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Preface 



This book describes the assembly language supported by the compiler sys- 
tem, it's syntax rules, and shows how to write some assembly programs. 
For information about assembling and linking programs written in assem- 
bler language, see the IRIS-4D Series Compiler Guide. 

The assembler converts assembly language statements into machine code. 
In most assembly languages, each instruction corresponds to a single ma- 
chine instruction; however, some assembly language instructions can gener- 
ate several machine instructions. This feature results in assembly programs 
that can run without modification on future machines, which might have 
different machine instructions. See Appendix B for more information 
about assembler instructions that generate multiple machine instructions. 



Who Should Read This Book? 

This book assumes that you are an experienced assembly language pro- 
grammer. 

The assembler exists primarily to produce object modules from the assem- 
bly instructions that the C and Fortran 77 compilers generate. It therefore 
lacks many functions normally present in assemblers. Therefore, we rec- 
ommend that you use the assembler only when you need to: 

• maximize the efficiency of a routine, which might not be possible in C 
or Fortran 77— for example, to write low-level I/O drivers 

• access machine functions unavailable from high-level languages or sat- 
isfy special constraints such as restricted register usage 

• change the operating system 

• change the compiler system 
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What Does This Book Cover? 

This book has these chapters: 

• Chapter 1— Registers describes the format for the general registers, g" 
the special registers, and the floating point registers. \ 

• Chapter 2— Addressing describes how addressing works. 

• Chapter 3— Exceptions describes exceptions you might encounter with 
assembly programs. 

• Chapter 4— Lexical Conventions describes the lexical conventions 
that the assembler follows. 

• Chapter 5— Instruction Set describes the main processor's instruction 
set, including notation, load and store instructions, computational in- 
structions, and jump and branch instructions. 

• Chapter 6— Coprocessor Instruction Set describes the coprocessor 
instruction sets. 

• Chapter 7— Linkage Conventions describes linkage conventions for all 
supported high-level languages. It also discusses memory allocation I 
and register use. V 

• Chapter 8— Pseudo-Operations describes the assembler's pseudo-op- 
erations (directives) . 

• Chapter 9— Object File Format provides an overview of the compo- 
nents comprising the object file and describes the headers and sections 
of the object file. 

• Chapter 10— The Symbol Table describes the purpose of the Symbol 
Table and the format of entries in the table. This chapter also lists 
the symbol table routines that are supplied. 

• Appendix A summarizes all instructions. 



Appendix B describes instructions that generate more than one ma- 
chine instruction. 
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1. Registers 



Chapter 1 discusses the registers and describes how memory organization 
affects them. Refer to Chapter 7 for information regarding register use 
and linkage. 

The machine has these registers: 

• general registers, which are always one word wide 

• coprocessor registers (for example, floating point registers) 

• two special registers that hold the results of multiplication and division 
instructions 

You must use general registers where the assembly instructions expect gen- 
eral registers and floating point registers where the assembly instructions 
expect floating point registers. If you confuse the two, the assembler is- 
sues an error message. 



1.1 Register Format 



A machine's byte ordering scheme (or endian issues) affects memory or- 
ganization and defines the relationship between address and byte position 
of data in memory. R2000 machines can be big-endian or little-endian. 
Big-endian machines store the sign bit in the lowest address byte. Little- 
endian machines store the sign bit in the highest address byte. The R2000 
processors used in the IRIS-4D Series workstations are configured as big- 
endian. 
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1.1.1 Big-Endian Machines 

Big-endian machines number the bytes of a word from to 3. Byte 
holds the sign and most significant bits. 

For halfwords, big-endian machines number the bytes from to 1. Byte 
holds the sign and most significant bits. 

Big-endian machines number the bits of each byte from to 7, using this 
format: 

• Bit holds the most significant bit. 

• Bit 7 holds the least significant bit. 



1.2 General Registers 

Each general register has 32 bits. The assembler reserves all register 
names, and you must use lowercase for the names. All register names 
start with a dollar sign ($). 

The general registers have the names $0..$31. By including the file reg- 
def.h (use #include regdef.h) in your program, you can use software 
names for some general registers. The operating system and the assembler 
use the general registers $1, $26, $27, $28, and $29 for specific purposes. 
(NOTE: Attempts to use these general registers in other ways can produce 
unexpected results.) If a program uses the names $1, $26, $27, $28, $29 
rather than the names $at, $ktO, $ktl, $gp, $sp respectively, the assem- 
bler issues warning messages. 

General register $0 always contains the value 0. All other general registers 
are equivalent, except that general register $31 also serves as the implicit 
link register for jump and link instructions. See Chapter 7 for a descrip- 
tion of register assignments. 



( 
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Software Name 
Register Name (from regdef . h) Use and Linkage 



$0 
$at 

$2. .$3 



$4. .$7 



vO-vl 



a0-a3 



always has the value 

reserved for the assembler 

used for expression evaluations and to hold 
the integer type function results. Also used 
to pass the static link when calling nested 
procedures. 

used to pass the first 4 words of integer 
type actual arguments, their values are 
not preserved across procedure calls 



*. .$15 



t0-t7 



temporary registers used for expression evalu- 
ations; their values aren't preserved across 
procedure calls. 



$16. .$23 



$24.. $25 



$31 



s0-s7 



t8-t9 



$kt0..$ktl 


kO-kl 


$28 or $gp 


gP 


$29 or $sp 


sp 


$30 or $fp 


fp 



saved registers . Their values must be pre- 
served across procedure calls. 

temporary registers used for expression evalu- 
ations; their values aren't preserved across 
procedure calls. 

c 
reserved for the operating system kernel 

contains the global pointer 

contains the stack pointer 

contains the frame pointer (if needed) ; 
otherwise a saved register (like s0-s7) 

contains the return address and used 
for expression evaluation 



Table 1-1. General Registers 
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1.3 Special Registers 

The machine has two 32 bit special registers. The hi and lo special regis- 
ters hold the results of the multiplication (mult and multu) and division 
(div and divu) instructions. g- 

You usually do not need to refer explicitly to these special registers. In- 
structions that use the special registers refer to them automatically. 

Nam© Description 

hi Multiply/Divide special register holds the most significant 

32 bits of multiply, remainder of divide 

lo Multiply/Divide special register holds the least significant 

32 bits of multiply, quotient of divide 



Table 1-2. Special Registers 



1.4 Floating Point Registers ^ 

There are 32 32-bit (or 16 64-bit) floating point registers on the R2000 
processor, numbered $f0..$f31. All references to these registers by float- 
ing point instructions must be to an even register, so most applications 
should use even-based register pairs as well as double-precision floating 
point values. Chapter 7 further describes floating point register use. 
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Double Precision" 





I** — Single Precision H 




$f0: 


$f0 


$f1 


$f2: 


$f2 


$f3 


$f4: 


$f4 


$f5 


$f6: 


$f6 


$f7 


$f8: 


$f8 


$f9 


$f10: 


$f10 


$f11 


$g12 


$g12 


$f13 


$f14: 


$f14 


$f15 


$f16: 


$f16 


$f17 


$f18: 


$f18 


$f19 


$f20: 


$f20 


$f21 


$f22: 


$f22 


$f23 


$f24: 


$f24 


$f25 


$f26: 


$f26 


$f27 


$f28: 


$f28 


$f29 


$f30: 


$f30 


$f31 



' 32 bits ■ 



64 bits ' 



Figure 1-1. Floating Point Register Set 
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2. Addressing 



Chapter 2 describes the formats that you can use to specify addresses. 
The machine uses a byte addressing scheme. Access to half words requires 
alignment on even byte boundaries, and access to words requires align- 
ment on byte boundaries that are divisible by four. Any attempt to ad- 
dress a data item that does not have the proper alignment causes an align- 
ment exception. 

The unaligned assembler load and store instructions may generate multiple 
machine language instructions. They do not raise alignment exceptions. 
These instructions load and store unaligned data: 

• load word left (lwl) 

• load word right (lwr) 

• store word left (swl) 

• store word right (swr) 

• unaligned load word (ulw) 

• unaligned load halfword (ulh) 

• unaligned load halfword unsigned (ulhu) 

• unaligned store word (usw) 

• unaligned store halfword (ush) 

• unaligned store halfword unsigned (ushu) 
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These instructions load and store aligned data: 

• load word (lw) 

• load halfword (lh) 

• load halfword unsigned (Ihu) 

• load byte (lb) 

• load byte unsigned (lbu) 

• store word (sw) 

• store halfword (sh) 

• store byte (sb) 

2.1 Address Formats 

The assembler accepts these formats for addresses: 



Format 



(base register) 

expression 

expression (base register) 

relocatable-symbol 
relocatable-symbol - expression 



Address 



base address (zero 
offset assumed) 

absolute address 
based address 

relocatable 
address 

relocatable address 



relocatable-symbol ± expression (index register) indexed relocatable 

address 



Table 2-1 . Formats for Addresses 



( 
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2.2 Address Descriptions 



The assembler accepts any combination of the constants and operations 
described in Chapter 4 for expressions in address descriptions. Table 2-2 
describes expressions and their address descriptions. 



Expression 



Address Description 



( base-register ) 



Specifies an indexed address, 
which assumes a zero offset. The 
base-register's contents specify the 
address. 



expression 



expression ( base-register ) 



relocatable-symbol 



relocatable-symbol ± expression 



Specifies an absolute address. The 
assembler generates the most lo- 
cally efficient code for referencing 
a value at the specified address. 

Specifies a based address. To get 
the address, the machine adds the 
value of the expression to the con- 
tents of the base-register. 

Specifies a relocatable address. 
The assembler generates the neces- 
sary instruction (s) to address the 
item and generates relocatable in- 
formation for the link editor. 

Specifies a relocatable address. To 
get the address, the assembler adds 
or subtracts the value of the ex- 
pression, which has an absolute 
value, from the relocatable symbol. 
The assembler generates the neces- 
sary instruction (s) to address the 
item and generates relocatable in- 
formation for the link editor. If 
the symbol name does not appear 
as a label anywhere in the assem- 
bly, the assembler assumes that the 
symbol is external. 



Table 2-2. Address Descriptions 



Version 1.0 



Addressing 2-3 



Expression 



Address Description 



relocatable-symbol ( base-register ) Specifies an indexed relocatable ad- 
dress. To get the address, the ma- 
chine adds the index-register to 
the relocatable symbol's address. 
The assembler generates the neces- 
sary instruction (s) to address the 
item and generates relocatable in- 
formation for the link editor. If 
the symbol name does not appear 
as a label anywhere in the assem- 
bly, the assembler assumes that the 
symbol is external. 



relocatable-symbol ± expression 
( base-register ) 



Specifies an indexed relocatable 
address. To get the address, the 
assembler adds or subtracts the 
relocatable symbol, the expression, 
and the contents of the index-reg- 
ister. The assembler generates the 
necessary instruction (s) to address 
the item and generates relocation 
information for the link editor. If 
the symbol does not appear as a 
label anywhere in the assembly, the 
assembler assumes that the symbol 
is external. 



( 



Table 2-2. Address Descriptions (continued) 
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3. Exceptions 



Chapter 3 describes the exceptions that you can encounter while running 
assembly programs. The machine detects some exceptions directly, and 
the assembler inserts specific tests that signal other exceptions. This chap- 
ter lists only those exceptions that occur most frequently. 



3.1 Main Processor Exceptions 

For the assembly language programmer, these are the most common main 
processor exceptions: 

• address error exceptions, which occur when the machine references a 
data item that is not on its proper memory alignment or when an ad- 
dress is invalid for the executing process 

• overflow exceptions, which occur when arithmetic operations compute 
signed values and the destination lacks the precision to store the result 

• bus exceptions, which occur when an address is invalid for the execut- 
ing process 

• divide-by-zero exceptions, which occur when a divisor is zero 
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3.2 Floating Point Exceptions 

These are the floating point exceptions (not implemented for first release 
of the IRIS-4D Series): 

• invalid operation exceptions 

o magnitude subtraction of infinities, for example: +00 — oo 

o multiplication of by 00 with any signs 

o division of 0/0 or 00/00 with any signs 

o conversion of a binary floating-point number to an integer format 
when an overflow or the operand value for the infinity or NaN 
precludes a faithful representation in the format (see Chapter 6) 

o comparison of predicates that have unordered operands, and that 
involve Greater Than or Leas Than without Unordered. 

o any operation on a signaling NaN 

• divide-by-zero exceptions 

• overflow exceptions— these occur when a rounded floating point result 
exceeds the destination format's largest finite number 

• underflow exceptions— these occur when a result has lost accuracy and 

also when a nonzero result is between ±2 Emm (plus or minus 2 to 
the minimum expressable exponent) . 

• inexact exceptions 



( 
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4. Lexical Conventions 

Chapter 4 discusses lexical conventions for these topics: 

• tokens 

• comments 

• identifiers 

• constants 

• multiple lines per physical line 

• sections and location counters 

• statements 

• expressions 

This chapter uses the following notation to describe syntax: 

• | (vertical bar) means "or" 

• [ ] (square brackets) enclose options 

• ± indicates both addition and subtraction operations 
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4.1 Tokens 

The assembler has these tokens: 

• identifiers 

• constants 

• operators 

The assembler lets you put blank characters and tab characters anywhere 
between tokens; however, it does not allow these characters within tokens 
(except for character constants). A blank or tab must separate adjacent 
identifiers or constants that are not otherwise separated. 



4.2 Comments 

The pound sign character (#) introduces a comment. Comments that start 

with a # extend through the end of the line on which they appear. You 

can also use C-language notation /*...*/ to delimit comments. £ 

The assembler uses cpp (the C language preprocessor) to preprocess as- 
sembler code. Because cpp interprets #s in the first column as pragmas 
(compiler directives), do not start a # comment in the first column. 



4.3 Identifiers 

An identifier consists of a case-sensitive sequence of alphanumeric charac- 
ters, including these: 

• . (period) 

• _ (underscore) 

• $ (dollar sign) | 

Identifiers can be up to 31 characters long, and the first character cannot 
be numeric. 
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If an identifier is not defined to the assembler (only referenced), the as- 
sembler assumes that the identifier is an external symbol. The assembler 
treats the identifier as if a .globl pseudo-operation was encountered (see 
Chapter 8) . If the identifier is defined to the assembler and the identifier 
has not been specified as global, the assembler assumes that the identifier 
is a local symbol. 



4.4 Constants 

The assembler has these constants: 

• scalar constants 

• floating point constants 

• string constants 



4.4.1 Scalar Constants 

The assembler interprets all scalar constants as twos complement numbers. 
Scalar constants can be any of the digits 0123456789abcdefABCDEF. 

Scalar constants can be one of these constants: 

• decimal constants, which consist of a sequence of decimal digits with- 
out a leading zero 

• hexadecimal constants, which consist of the characters Ox (or OX ) 
followed by a sequence of digits 

• octal constants, which consist of a leading zero followed by a sequence 
of digits in the range 0..7 
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4.4.2 Floating Point Constants 

Floating point constants can appear only in .float and .double pseudo-op- 
erations (directives)— see Chapter 8. Floating point constants follow this 
form: 

±dl[.d2] [e|E±dS] 

Where: 

• dl is written as a decimal integer and denotes the integral part of the 
floating point value 

• d2 is written as a decimal integer and denotes the fractional part of 
the floating point value 

• d3 is written as a decimal integer and denotes a power of 10 

• the "+" symbol is optional 
For example: 

21.73E-3 

represents the number .02173. 



4.4.3 String Constants 

String constants begin and end with double quotation marks ("). 

The assembler observes C language backslash conventions. For octal no- 
tation, the backslash conventions require three characters when the next 
character could be confused with the octal number. For hexadecimal no- 
tation, the backslash conventions require two characters when the next 
character could be confused with the hexadecimal number (i.e., use a 
for the first character of a single character hex number) . 

The assembler follows the backslash conventions listed in Table 4-1: 



( 



( 



( 
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Convention 


Meaning 


\a 


alert (0x07) 


\b 


backspace (0x08) 


\f 


form feed (0x0 c) 


\n 


newline (0x0a) 


\r 


carriage return (OxOd) 


\t 


horizontal tab (0x09) 


\v 


vertical feed (0x0b) 


\\ 


backslash (0x5 c) 


\" 


quotation mark (0x22) 


Y 


single quote (0x27) 


\000 


character whose octal value is 000 


\Xnn 


character whose hexadecimal value is nn 



Tabie 4-1. Backslash Conventions 



4.5 Multiple Lines Per Physical Line 

You can include multiple statements on the same line by separating the 
statements with semicolons. The assembler does not recognize semicolons 
as separators when they follow comment symbols (# or /*). 
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4.6 Sections and Location Counters 



Assembled code and data fall in one of six sections as illustrated in Figure 
4-1: 



( 



.text 



jiiiisiii 



.data 



iilialal 



;sbss 



l&ssl 



text section 



read-only data section 



data section 



small data section, addressed 
through register $gp 



small bss section, addressed 
through register $gp 



bss (block started by storage) 
section, which holds zero- 
initialized data 



( 



Figure 4-1. Location Counters 



(For more information on section data, see Chapter 9 of this manual.) 

The assembler always generates the text section before other sections. 
Additions to the text section happen in four-byte units. Each section has 
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an implicit location counter, which begins at zero and increments by one 
for each byte assembled in the section. 

The bss section holds zero-initialized data. If a Jcomm pseudo-op de- 
fines a variable (see Chapter 8), the assembler assigns that variable to the 
bss (block started by storage) section or to the sbss (short block started 
by storage) section depending on the variable's size. The default variable 
size for sbss is 512 or fewer bytes. 

The command line option -G for each compiler (C, Pascal, Fortran 77, 
or the assembler) , can increase the size of sbss to cover all but extremely 
large data items. The link editor issues an error message when the -G 
value gets too large. If a -G value is not specified to the compiler, 512 is 
the default. Items smaller than, or equal to, the specified size go in sbss. 
Items greater than the specified size go in bss. 

Because you can address items much more quickly through $gp than 
through a more general method, put as many items as possible in sdata or 
sbss. The size of sdata and sbss combined must not exceed 64K bytes. 



4.7 Statements 

Each statement consists of an optional label, an operation code, and the 
operand(s). The machine allows these statements: 

• null statements 

• keyword statements 

4.7.1 Label Definitions 

A label definition consists of an identifier followed by a colon. Label 
definitions assign the current value and type of the location counter to the 
name. An error results when the name is already defined, the assigned 
value changes the label definition, or both conditions exists. 

Label definitions always end with a colon. You can put a label definition 
on a line by itself. 

A generated label is a single numeric value (1...255). To reference a 
generated label, put an f (forward) or a b (backward) immediately after 
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the digits. The reference tells the assembler to look for the nearest gener- 
ated label that corresponds to the number in the lexically forward or back- 
ward direction. 



4,7.2 Null Statements 

A null statement is an empty statement that the assembler ignores. Null 
statements can have label definitions. For example, this line has three 
null statements in it: 

labell: ; ; 



4.7.3 Keyword Statements 

A keyword statement begins with a predefined keyword. The syntax for 
the rest of the statement depends on the keyword. All instruction op- 
codes are keywords. All other keywords are assembler pseudo-operations 
(directives) . 



c 



4.8 Expressions 

An expression is a sequence of symbols that represent a value. Each ex- 
pression and its result have data types. The assembler does arithmetic in 
twos complement integers with 32 bits of precision. Expressions follow 
precedence rules and consist of: 

• operators 

• identifiers 

• constants 

Also, you may use a single character string in place of an integer within an 
expression. Thus: 

.byte "a M ; .word "a "+0x19 1 / 

is equivalent to 

.byte 0x61 ; .word 0x7a 
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4.8.1 Precedence 

Unless parentheses enforce precedence, the assembler evaluates all opera- 
tors of the same precedence strictly from left to right. Because parenthe- 
ses also designate index-registers, ambiguity can arise from parentheses in 
expressions. To resolve this ambiguity, put a unary + in front of parenthe- 
ses in expressions. 

The assembler has three precedence levels, which are listed in Table 4-2 
from lowest to highest precedence: 



least binding, 

lowest precedence: binary +. - 

binary *, /, %, «, », A , &, 
most binding 

highest precedence: unary -, +, ~ 

Table 4-2. Precedence Levels 



NOTE: The assembler's precedence scheme differs from that of the C 
language. 



4.8.2 Expression Operators 

For expressions, you can rely on the precedence rules, or you can group 
expressions with parentheses. The assembler has these operators: 
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Operator 


Meaning 


+ 


addition 


- 


subtraction 


* 


multiplication 


/ 


division 


% 


remainder 


« 


shift left 


» 


shift right (sign NOT extended) 


- 


bitwise EXCLUSIVE OR 


& 


bitwise AND 


| 


bitwise OR 


- 


minus (unary) 


+ 


identity (unary) 


~ 


complement 



( 



Table 4-3. Operators 



4.8.3 Data Types 

The assembler manipulates several types of expressions. Each symbol you 
reference or define belongs to one of the categories listed in Table 4-4: 



( 
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Type 



Description 



undefined Any symbol that is referenced but not defined be- 

comes global undefined, and this module will at- 
tempt to import it. The assembler uses 32-bit ad- 
dressing to access these symbols. (Declaring such 
a symbol in a .globl pseudo-op merely makes its 
status clearer). 

sundefined A symbol defined by a .extern pseudo-op be- 

comes global small undefined if its size is greater 
than zero but less than the number of bytes speci- 
fied by the -G option on the command line (which 
defaults to 512). The linker places these symbols 
within a 64k byte region pointed to by the $gp 
register, so that the assembler can use economical 
16-bit addressing to access them. 

absolute A constant defined in an "=" expression. 

text The text section contains the program's instruc- 

tions, which are not modifiable during execution. 
Any symbol defined while the .text pseudo-op is 
in effect belongs to the text section. 

data The data section contains memory which the 

linker can initialize to nonzero values before your 
program begins to execute. Any symbol defined 
while the .data pseudo-op is in effect belongs to 
the data section. The assembler uses 32-bit ad- 
dressing to access these symbols. 

sdata This category is similar to data, except that defin- 

ing a symbol while the .sdata ("small data") 
pseudo-op is in effect causes the linker to place it 
within a 64k byte region pointed to by the $gp 
register, so that the assembler can use economical 
16-bit addressing to access it. 



Table 4-4. Data Types 
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Type Description 



rdata Any symbol defined while the .rdata pseudo-op is 

in effect belongs to this category, which is similar 
to data, but may not be modified during execu- 
tion. 

bss and sbss The bss and sbss sections consist of memory 

which the kernel loader initializes to zero before 
your program begins to execute. Any symbol de- 
fined in a .comm or .Icomm pseudo-op belongs to 
these sections (except that a .data, .sdata, or 
.rdata pseudo-op can override a .comm direc- 
tive) . If its size is less than the number of bytes 
specified by the -G option on the command line 
(which defaults to 512), it belongs to sbss ("small 
bss"), and the linker places it within a 64k byte 
region pointed to by the $gp register so that the 
assembler can use economical 16-bit addressing to 
access it. Otherwise, it belongs to bss and the 
assembler uses 32-bit addressing. 

Local symbols in bss or sbss defined by .lcomm 
are allocated memory by the assembler; global 
symbols are allocated memory by the link editor; 
and symbols defined by .comm are overlaid upon 
like-named symbols (in the fashion of Fortran 
"COMMON" blocks) by the link editor. 



Table 4-4. Data Types (continued) 

Symbols in the undefined and small undefined categories are always global 
(that is, they are visible to the link editor and can be shared with other mod- 
ules of your program). Symbols in the absolute, text, data, sdata, rdata, bss, 
and sbss categories are local unless declared in a .globl pseudo-op. 



( 



c 



c 



4-12 Assembly Language Programmer's Guide IRIS-4D Series 



4.8.4 Type Propagation in Expressions 

When expression operators combine expression operands, the result's type 
depends on the types of the operands and on the operator. Expressions 
follow these type propagation rules: 

• If an operand is undefined, the result is undefined. 

• If both operands are absolute, the result is absolute. 

• If the operator is + and the first operand refers to a relocatable text- 
section, data-section, bss-section, or an undefined external, the result 
has the postulated type and the other operand must be absolute. 

• If the operator is - and the first operand refers to a relocatable 
text-section, data-section, or bss-section symbol, the second operand 
can be absolute and the result has the first operand's type; or the sec- 
ond operand can have the same type as the first operand and the re- 
sult is absolute. If the first operand is external undefined, the second 
operand must be absolute. 

• The operators * , /, % , « , » , ~, A , & , and | apply only 
to absolute symbols. 
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( 



5. Instruction Set 



Chapter 5 describes instruction notation and discusses assembler instruc- 
tions for the main processor. Chapter 6 describes coprocessor notation 
and instructions. 

The assembler has the classes of instructions for the main processor listed 
in Table 5-1: 



Instruction 



Description 



Load and Store Instructions 



Computational Instructions 



Jump and Branch Instructions 



Coprocessor Interface 



Special Instructions 



These instructions load immediate 
values and move data between 
memory and general registers. 

These instructions do arithmetic 
and logical operations for values in 
registers. 

These instructions change program 
control flow. 

These instructions provide standard 
interfaces to the coprocessors. 

These instructions do miscellaneous 
tasks. 



Table 5-1. Instruction Classes 
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5.1 Instruction Notation 



The tables in Chapter 5 list the assembler format for each load, store, 
computational, jump, branch, coprocessor, and special instruction. The 
format consists of an op-code and a list of operand formats. The tables 
list groups of closely related instructions; for those instructions, you can 
use any op-code with any specified operand. Operands can take any of 
these formats: 

• memory references— for example a relocatable symbol +/- an expres- 
sion (register) 

• expressions (for immediate values) 

• two or three operands— for example, add $3, $4 is the same as add 



5.2 Load and Store Instructions 

The machine has general-purpose load and store instructions. 

5.2.1 Load and Store Formats 

Table 5-2 lists operands and their descriptions. Table 5-3 shows the for- 
mats of available load and store instructions. 

Operand Description 

destination the destination register 

address a symbolic expression (see Chapter 2) 

source the source register 

expression an absolute value 

Table 5-2. Load and Store Operands 



( 
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Description 


Op-code 


Operands 


Load Address 


la 


destination, address 


Load Byte 


lb 




Load Byte Unsigned 


lbu 




Load Halfword 


lh 




Load Halfword Unsigned 


lhu 




Load Word 


lw 




Load Word Left 


lwl 




Load Word Right 


lwr 




Load Double 


Id 




Unaligned Load Halfword 


ulh 




Unaligned Load Halfword Unsigned ulhu 




Unaligned Load Word 


ulw 




Load Immediate 


li 


destination, expression 


Load Upper Immediate 


lui 




Store Byte 


sb 


source, address 


Store Double 


sd 




Store Halfword 


sh 




Store Word Left 


swl 




Store Word Right 


swr 




Store Word 


sw 




Unaligned Store Halfword 


ush 




Unaligned Store Word 


usw 





Table 5-3. Load and Store Instruction Formats 
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5.2.2 Load Instruction Descriptions 

For all machine load instructions, the effective address is the 32-bit 
twos-complement sum of the contents of the index-register and the (sign- 
extended) 16-bit offset. Instructions that have symbolic labels imply an 
index-register, which the assembler determines. The assembler supports 
additional load instructions, which can produce multiple machine instruc- 
tions. NOTE: Load instructions can generate many code sequences for 
which the link editor must fix the address by resolving external data items. 



Instruction Name 



Description 



Load Address (la) 
Load Byte (lb) 



Load Byte Unsigned (lbu) 



Loads the destination register with 
the effective address of the speci- 
fied data item. 

Loads the least significant byte of 
the destination register with the 
contents of the byte that is at the 
memory location specified by the 
effective address. The machine 
treats the loaded byte as a signed 
value: bit seven is extended to fill 
the three most significant bytes. 

Loads the least significant byte of 
the destination register with the 
contents of the byte that is at the 
memory location specified by the 
effective address. Because the ma- 
chine treats the loaded byte as an 
unsigned value, it fills the three 
most significant bytes of the desti- 
nation register with zeros. 



( 



Table 5-4. Load Instruction Descriptors 



( 
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Instruction Name 



Description 



Load Double (Id) 



Load Halfword (lh) 



Load Halfword Unsigned (Ihu) 



Loads the register pair (destination 
and destination + 1) with the two 
successive words specified by the 
address. The destination register 
must be the even register of the 
pair. When the address is not on 
a word boundary, the machine sig- 
nals an address error exception. 
NOTE: For compatibility with fu- 
ture machines, we recommend the 
use of double word alignment for 
all double word operands. 

Loads the two least significant 
bytes of the destination register 
with the contents of the halfword 
that is at the memory location 
specified by the effective address. 
The machine treats the loaded 
halfword as a signed value. If the 
effective address is not even, the 
machine signals an address error 
exception. 

Loads the least significant bits of 
the destination register with the 
contents of the halfword that is at 
the memory location specified by 
the effective address. Because the 
machine treats the loaded halfword 
as an unsigned value, it fills the 
two most significant bytes of the 
destination register with zeros. If 
the effective address is not even, 
the machine signals an address er- 
ror exception. 



Tabie 5-4. Load Instruction Descriptors (continued) 
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Instruction Name 



Description 



Load Immediate (li) 



Load Upper Immediate (lui) 



Load Word (lw) 



Loads the destination register with 
the value of an expression that can 
be computed at assembly time. 

NOTE: Load Immediate can 

generate any efficient code se- 
quence to put a desired value in 
the register. 

Loads the most significant half of a 
register with the expression's value, 
The machine fills the least signifi- 
cant half of the register with zeros. 
The expression's value must be in 
the range -32768... 65535. 

Loads the destination register with 
the contents of the word that is at 
the memory location. The ma- 
chine replaces all bytes of the reg- 
ister with the contents of the 
loaded word. 

The machine signals an address er- 
ror exception when the effective 
address is not divisible by four. 



( 



( 



Table 5-4. Load Instruction Descriptors (continued) 
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Instruction Name 



Description 



Load Word Left (lwl) 



Load Word Right (lwr) 



Loads the sign— that is, Load 
Word Left loads the destination 
register with the most significant 
bytes of the word specified by the 
effective address. The effective 
address must specify the byte con- 
taining the sign. In a big-endian 
machine, the effective address 
specifies the lowest numbered byte, 
and in a little-endian machine the 
effective address specifies the high- 
est numbered byte. 

Only the bytes which share the 
same aligned word in memory are 
merged into the destination regis- 
ter. 

Loads the lowest precision bytes— 
that is, Load Word Right loads 
the destination register with the 
least significant bytes of the word 
specified by the effective address. 
The effective address must specify 
the byte containing the least signifi- 
cant bits. In a big-endian ma- 
chine, the effective address speci- 
fies the highest numbered byte, 
and in a little-endian machine the 
effective address specifies the low- 
est numbered byte. 

Only the bytes which share the 
same aligned word in memory are 
merged into the destination regis- 
ter. 



Table 5-4. Load Instruction Descriptors (continued) 
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Instruction Name 



Description 



Unaligned Load Halfword (ulh) 



Unaligned Load Halfword 
Unsigned (ulhu) 



Unaligned Load Word (ulw) 



Loads a halfword into the destina- 
tion register from the specified ad- 
dress and extends the sign of the 
halfword. Unaligned Load 
Halfword loads a halfword regard- 
less of the halfword's alignment in 
memory. 

Loads a halfword into the destina- 
tion register from the specified ad- 
dress and zero extends the 
halfword. Unaligned Load 
Halfword Unsigned loads a 
halfword regardless of the 
halfword* s alignment in memory. 

Loads a word into the destination 
register from the specified address. 
Unaligned Load Word loads a 
word regardless of the word's align- 
ment in memory. 



( 



( 



Table 5-4. Load Instruction Descriptors (continued) 



5.2,3 Store Instruction Descriptions 

For all machine store instructions, the effective address is the 32-bit twos- 
complement sum of the contents of the index-register and the (sign-ex- 
tended) 16-bit offset. The assembler supports additional store instruc- 
tions, which can produce multiple machine instructions. Instructions that 
have symbolic labels imply an index-register, which the assembler deter- 
mines. 



( 
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Instruction Name 



Description 



Store Byte (sb) 



Store Halfword (sh) 



Store Word (sw) 



Store Double (sd) 



Stores the contents of the source 
register's least significant byte in 
the byte specified by the effective 
address. 

Stores the two least significant 
bytes of the source register in the 
halfword that is at the memory lo- 
cation specified by the effective ad- 
dress. The effective address must 
be divisible by two, otherwise the 
machine signals an address error 
exception. 

Stores the contents of a word from 
the source register in the memory 
location specified by the effective 
address. The effective address 
must be divisible by four, otherwise 
the machine signals an address er- 
ror exception. 

Stores the contents of the register 
pair in successive words, which the 
address specifies. The source reg- 
ister must be the even register of 
the pair, and the storage address 
must be word aligned. NOTE: 
For compatibility with future ma- 
chines, we recommend that you 
use double word alignment. 



Table 5-5. Store Instruction Descriptors 
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Instruction Name 



Description 



Store Word Left (swl) 



Store Word Right (swr) 



Unaligned Store Halfword (ush) 



Unaligned Store Word (usw) 



Stores the most significant bytes of 
a word in the memory location 
specified by the effective address. 
The contents of the word at the 
memory location, specified by the 
effective address, are shifted right 
so that the leftmost byte of the un- 
aligned word is in the addressed 
byte position. The stored bytes re- 
place the corresponding bytes of 
the effective address. The effec- 
tive address's last two bits deter- 
mine how many bytes are involved. 

Stores the least significant bytes of 
a word in the memory location 
specified by the effective address. 
The contents of the word at the 
memory location, specified by the 
effective address, are shifted left so 
that the right byte of the unaligned 
word is in the addressed byte posi- 
tion. The stored bytes replace the 
corresponding bytes of the effective 
address. The effective address's 
last two bits determine how many 
bytes are involved. 

Stores the contents of the two least 
significant bytes of the source regis- 
ter in a halfword that the address 
specifies. The machine does not 
require alignment for the storage 
address. 

Stores the contents of the source 
register in a word specified by the 
address. The machine does not 
require alignment for the storage 
address. 



( 



( 



( 



Table 5-5. Store Instruction Descriptors (continued) 
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5.3 Computational Instructions 

The machine has general-purpose and coprocessor-specific computational 
instructions (for example, the floating point coprocessor). This part of the 
book describes general-purpose computational instructions. 



5.3.1 Computational Formats 

Table 5-6 shows computational operands and their descriptions. Table 
5-7 shows the formats of computational instructions. 



Operand Description 

destination/srcl the destination register is 

also source register 1 

destination the destination register 

immediate tne immediate value 

srcl,src2 the source registers 

Table 5-6. Computational Operands 
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Description 


Op-code 


Operand 


Add (with overflow) 
Add (without overflow) 

AND 


add 

addu 

and 


destination, src 1 , src2 
destination/src 1 ,src2 
destination, src 1, 


Divide (signed) 
Divide (unsigned) 
EXCLUSIVE OR 


div 

divu 

xor 


immediate 
destination/src 1, 
immediate 


Multiply 


mul 




Multiply with Overflow 
Multiply with Overflow Unsigned 


mulo 
mulou 




NOT OR 


nor 




OR 


or 




Set Equal 


seq 




Set Greater 


sgt 




Set Greater/Equal 

Set Greater/Equal Unsigned 


sge 
sgeu 




Set Greater Unsigned 


sgtu 




Set Less 


sit 




Set Less/Equal 


sle 




Set Less/Equal Unsigned 


sleu 




Set Less Unsigned 


situ 




Set Not Equal 


sne 




Subtract (with overflow) 


sub 




Subtract (without overflow) 


subu 





c 



( 



Table 5-7. Computational Instruction Formats 



( 
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Description 


Op-code 


Operand 


Remainder (signed) 


rem 


destination, srcl, src2 


Remainder (unsigned) 


remu 


destination/srcl,src2 


Rotate Left 


rol 


destination, srcl, 


Rotate Right 


ror 


immediate 


Shift Right Arithmetic 


sra 


destination/srcl, 


Shift Left Logical 


sll 


immediate 


Shift Right Logical 


srl 




Absolute Value 


abs 


destination, srcl 


Negate (with overflow) 


neg 


destination/srcl 


Negate (without overflow) 


negu 




NOT 


not 




Move 


move 


destination, srcl 


Multiply 


mult 


srcl,src2 


Multiply (unsigned) 


multu 





Table 5-7. Computational Instruction Formats (continued) 

5.3.2 Computational Instruction Descriptions 

Table 5-8 shows the descriptions of computational instructions. 
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Instruction Name 



Description 



Absolute Value (abs) 



Add (with overflow) (add) 



Add (without overflow) (addu) 



AND (and) 



Computes the absolute value of the 
contents of srcl and puts the result 
in the destination register. If the 
value in srcl is -2147483648, the 
machine signals an overflow excep- 
tion. 

Computes the twos complement 
sum of two signed values. This in- 
struction adds the contents of srcl 
to the contents of src2, or it can 
add the contents of srcl to the im- 
mediate value. Add (with over- 
flow) puts the result in the destina- 
tion register. When the result can- 
not be extended as a 32-bit num- 
ber, the machine signals an over- 
flow exception. 

Computes the twos complement 
sum of two 32-bit values. This in- 
struction adds the contents of srcl 
to the contents of src2, or it can 
add the contents of srcl to the im- 
mediate value. Add (without 
overflow) puts the result in the 
destination register. Overflow ex- 
ceptions never occur. 

Computes the Logical AND of two 
values. This instruction ANDs 
(bit- wise) the contents of srcl with 
the contents of src2, or it can 
AND the contents of srcl with the 
immediate value. The immediate 
value is not sign extended. AND 
puts the result in the destination 
register. 



Table 5-8. Computational Instruction Descriptions 



( 



( 



( 
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Instruction Name 



Description 



Divide (signed) (div) 



Computes the quotient of two val- 
ues. Divide (with overflow) treats 
srcl as the dividend. The divisor 
can be src2 or the immediate 
value. The instruction divides the 
contents of srcl by the contents of 
src2, or it can divide srcl by the 
immediate value. It puts the quo- 
tient in the destination register. If 
the divisor is zero, the machine 
signals an error. The div instruc- 
tion rounds toward zero. Overflow 
is signaled when dividing 
-2147483648 by -1. 

NOTE: The special case 



div $0,srcl,src2 



Divide (unsigned) (divu) 



generates the real machine divide 
instruction and leaves the result in 
the hi/lo register. The hi register 
contains the remainder and the lo 
register contains the quotient. No 
checking for divide by zero is per- 
formed. 

Computes the quotient of two un- 
signed 32-bit values. Divide 
(without overflow) treats srcl as 
the dividend. The divisor can be 
src2 or the immediate value. This 
instruction divides the contents of 
srcl by the contents of src2, or it 
can divide the contents of srcl by 
the immediate value. Divide 
(without overflow) puts the quo- 
tient in the destination register. If 
the divisor is zero, the machine 
signals an exception. 



Table 5-8. Computational Instruction Descriptions (continued) 
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Instruction Name 



Description 



EXCLUSIVE OR (xor) 



Move (move) 
Multiply (mul) 



Multiply (mult) 



See the note for div concerning $0 
as a destination. 

Overflow exceptions never occur. 

Computes the XOR of two values. 
This instruction XORs (bit-wise) 
the contents of srcl with the con- 
tents of src2, or it can XOR the 
contents of srcl with the immedi- 
ate value. The immediate value is 
not sign extended. EXCLUSIVE 
OR puts the result in the destina- 
tion register. 

Moves the contents of srcl to the 
destination register. 

Computes the product of two val- 
ues. This instruction puts the 
32-bit product of srcl and src2, or 
the 32-bit product of srcl and the 
immediate value, in the destination 
register. The machine does not 
report overflow. 

NOTE: Use mul when you do not 
need overflow protection: it's often 
faster than mulo and mulou. For 
multiplication by a constant, the 
mul instruction produces faster ma- 
chine instruction sequences than 
mult or multu instructions can 
produce. 

Computes the 64-bit product of 
two 32-bit signed values. This in- 
struction multiplies the contents of 
srcl by the contents of src2 and 
puts the result in the hi and lo 
registers (see Chapter 1). No 
overflow is possible. 



( 



( 
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Table 5-8. Computational Instruction Descriptions (continued) 
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Instruction Name 



Description 



Multiply Unsigned (multu) 



Multiply with Overflow (mulo) 



NOTE: The mult instruction is a 
real machine language instruction. 

Computes the product of two un- 
signed 32-bit values. It multiplies 
the contents of srcl and the con- 
tents of src2 and puts the result in 
the hi and lo registers (see Chap- 
ter 1). No overflow is possible. 

NOTE: The multu instruction is a 
real machine language instruction. 

Computes the product of two 
32-bit signed values. Multiply 
with Overflow puts the 32-bit 
product of srcl and src2, or the 
32-bit product of srcl and the im- 
mediate value, in the destination 
register. When a overflow occurs, 
the machine signals an overflow 
exception. 

NOTE: For multiplication by a 
constant, mulo produces faster ma- 
chine instruction sequences than 
mult or multu can produce; how- 
ever, if you do not need overflow 
detection, use the mul instruction. 
It's often faster than mulo. 



Multiply with Overflow Unsigned 

(mulou) 



Computes the product of two 
32-bit unsigned values. Multiply 
with Overflow Unsigned puts the 
32-bit product of srcl and src2, or 
the product of srcl and the imme- 
diate value, in the destination reg- 
ister. This instruction treats the 
multiplier and multiplicand as 
32-bit unsigned values. When an 
overflow occurs, the machine sig- 
nals an overflow exception. 



Table 5-8. Computational Instruction Descriptions (continued) 
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Instruction Name 



Description 



Negate (with overflow) (neg) 



NOTE: For multiplication by a 
constant, mulou produces faster 
machine instruction sequences than g 
mult or multu can produce; how- I 
ever, if you do not need overflow 
detection, use the mul instruction. 
It's often faster than mulou. 

Computes the negative of a value. 
This instruction negates the con- 
tents of srcl and puts the result in 
the destination register. If the 
value in srcl is -2147483648, the 
machine signals an overflow excep- 
tion. 



Negate (without overflow) 

(negu) 



NOT (not) 



NOT OR (nor) 



Negates the integer contents of 
srcl and puts the result in the des- 
tination register. The machine 
does not report overflows. 

Computes the Logical NOT of a 
value. This instruction comple- 
ments (bit-wise) the contents of 
srcl and puts the result in the des- 
tination register. 

Computes the NOT OR of two val- 
ues. This instruction combines the 
the contents of srcl with the con- 
tents of src2 (or the immediate 
value) . NOT OR complements the 
result and puts it in the destination 
register. 



c 



Table 5-8. Computational Instruction Descriptions (continued) 
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Instruction Name 



Description 



OR (or) 



Remainder (signed) (rem) 



Computes the Logical OR of two 
values. This instruction ORs (bit- 
wise) the contents of srcl with the 
contents of src2, or it can OR the 
contents of srcl with the immedi- 
ate value. The immediate value is 
not sign extended. OR puts the 
result in the destination register. 

Computes the remainder of the di- 
vision of two unsigned 32-bit val- 
ues. The machine defines the re- 
mainder rem(i,j) as i-(j*div(i,j)) 
where j 7^ 0. Remainder (with 
overflow) treats srcl as the divi- 
dend. The divisor can be src2 or 
the immediate value. This instruc- 
tion divides the contents of srcl by 
the contents of src2, or it can di- 
vide the contents of srcl by the 
immediate value. It puts the re- 
mainder in the destination register. 
The rem instruction rounds toward 
zero, rather than toward negative 
infinity. For example, 
div(5,-3)=-l, and rem(5,-3)=2. 
If the divisor is zero, the machine 
signals an error. 
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Instruction Name 



Description 



Remainder (unsigned) (remu) 



Rotate Left (rol) 



Computes the remainder of the di- 
vision of two unsigned 32-bit val- 
ues. The machine defines the re- 
mainder rem(i,j) as i-(j*div(i,j)) 
where j 7^ 0. Remainder Un- 
signed treats srcl as the dividend. 
The divisor can be src2 or the im- 
mediate value. This instruction di- 
vides the contents of srcl by the 
contents of src2, or it can divide 
the contents of srcl by the imme- 
diate value. Remainder Unsigned 
puts the remainder in the destina- 
tion register. If the divisor is zero, 
the machine signals an error. 

Rotates the contents of a register 
left (toward the sign bit) . This in- 
struction inserts in the least signifi- 
cant bit any bits that were shifted 
out of the sign bit. The contents 
of srcl specify the value to shift, 
and the contents of src2 (or the 
immediate value) specify the 
amount to shift. Rotate Left puts 
the result in the destination regis- 
ter. If src2 (or the immediate 
value) is greater than 31, srcl 
shifts by (src2 MOD 32). 



( 
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Instruction Name 



Description 



Rotate Right (ror) 



Set Equal (seq) 



Set Greater (sgt) 



Set Greater/Equal (sge) 



Rotates the contents of a register 
right (toward the least significant 
bit) . This instruction inserts in the 
sign bit any bits that were shifted 
out of the least significant bit. The 
contents of srcl specify the value 
to shift, and the the contents of 
src2 (or the immediate value) 
specify the amount to shift. Ro- 
tate Right puts the result in the 
destination register. If src2 (or the 
immediate value) is greater than 
32, srcl shifts by src2 MOD 32. 

Compares two 32-bit values. If 
the contents of srcl equal the con- 
tents of src2 (or srcl equals the 
immediate value) this instruction 
sets the destination register to one; 
otherwise, it sets the destination 
register to zero. 

Compares two signed 32-bit values. 
If the contents of srcl are greater 
than the contents of src2 (or srcl 
is greater than the immediate 
value), this instruction sets the des- 
tination register to one; otherwise, 
it sets the destination register to 
zero. 

Compares two signed 32-bit values. 
If the contents of srcl are greater 
than or equal to the contents of 
src2 (or srcl is greater than or 
equal to the immediate value) , this 
instruction sets the destination reg- 
ister to one; otherwise, it sets the 
destination register to zero. 
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Instruction Name 



Description 



Set Greater/Equal Unsigned 

(sgeu) 



Set Greater Unsigned (sgtu) 



Set Less (sit) 



Set Less/Equal (sle) 



Compares two unsigned 32-bit val- 
ues. If the contents of srcl are 
greater than or equal to the con- 
tents of src2 (or srcl is greater 
than or equal to the immediate 
value), this instruction sets the des- 
tination register to one; otherwise, 
it sets the destination register to 
zero. 

Compares two unsigned 32-bit val- 
ues. If the contents of srcl are 
greater than the contents of src2 
(or srcl is greater than the imme- 
diate value), this instruction sets 
the destination register to one; oth- 
erwise, it sets the destination regis- 
ter to zero. 

Compares two signed 32-bit values. 
If the contents of srcl are less 
than the contents of src2 (or srcl 
is less than the immediate value), 
this instruction sets the destination 
register to one; otherwise, it sets 
the destination register to zero. 

Compares two signed 32-bit values. 
If the contents of srcl are less 
than or equal to the contents of 
src2 (or srcl is less than or equal 
to the immediate value), this in- 
struction sets the destination regis- 
ter to one; otherwise, it sets the 
destination register to zero. 



( 
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Instruction Name 



Description 



Set Less/Equal Unsigned (sleu) 



Set Less Unsigned (situ) 



Set Not Equal (sne) 



Shift Left Logical (sll) 



Compares two unsigned 32-bit val- 
ues. If the contents of srcl are 
less than or equal to the contents 
of src2 (or srcl is less than or 
equal to the immediate value) this 
instruction sets the destination reg- 
ister to one; otherwise, it sets the 
destination register to zero. 

Compares two unsigned 32-bit val- 
ues. If the contents of srcl are 
less than the contents of src2 (or 
srcl is less than the immediate 
value), this instruction sets the des- 
tination register to one; otherwise, 
it sets the destination register to 
zero. 

Compares two 32-bit values. If 
the contents of scrl do not equal 
the contents of src2 (or srcl does 
not equal the immediate value) , 
this instruction sets the destination 
register to one; otherwise, it sets 
the destination register to zero. 

Shifts the contents of a register left 
(toward the sign bit) and inserts 
zeros at the least significant bit. 
The contents of srcl specify the 
value to shift, and the contents of 
src2 or the immediate value specify 
the amount to shift. If src2 (or 
the immediate value) is greater 
than 31 or less than 0, srcl shifts 
by src2 MOD 32. 
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Instruction Name 



Description 



Shift Right Arithmetic (sra) 



Shift Right Logical (srl) 



Subtract (with overflow) (sub) 



Shifts the contents of a register 
right (toward the least significant 
bit) and inserts the sign bit at the 
most significant bit. The contents 
of srcl specify the value to shift, 
and the contents of src2 (or the 
immediate value) specify the 
amount to shift. If src2 (or the 
immediate value) is greater than 31 
or less than 0, srcl shifts by the 
result of src2 MOD 32. 

Shifts the contents of a register 
right (toward the least significant 
bit) and inserts zeros at the most 
significant bit. The contents of 
srcl specify the value to shift, and 
the contents of src2 (or the imme- 
diate value) specify the amount to 
shift. If src2 (or the immediate 
value) is greater than 31 or less 
than 0, srcl shifts by the result of 
src2 MOD 32. 

Computes the twos complement 
difference for two signed values. 
This instruction subtracts the con- 
tents of src2 from the contents of 
srcl, or it can subtract the con- 
tents of the immediate from the 
srcl value. Subtract puts the re- 
sult in the destination register. 
When the true result's sign differs 
from the destination register's sign, 
the machine signals an overflow 
exception. 



( 



( 



Tabie 5-8. Computational Instruction Descriptions (continued) 



( 



5-24 Assembly Language Programmer's Guide 



IRIS-4D Series 



Instruction Name 



Description 



Subtract (without overflow) 

(subu) 



Computes the twos complement 
difference for two 32-bit values. 
This instruction subtracts the con- 
tents of src2 from the contents of 
srcl, or it can subtract the con- 
tents of the immediate from the 
srcl value. Subtract Unsigned 
puts the result in the destination 
register. Overflow exceptions 
never happen. 



Table 5-8. Computational Instruction Descriptions (continued) 



5.4 Jump and Branch Instructions 



The jump and branch instructions let you change an assembly program's 
control flow. 



5.4.1 Jump and Branch Formats 

Table 5-9 shows jump and branch operands and their descriptions. Table 
5-10 shows the formats of jump and branch instructions. 

Operand Description 

Jimn i" n i zasgsssamasaBSBBsrssmBBiansaBBanMaBaBSssa i.n u nun i urn 

address an expression 

srcl,src2 the source registers 

label a symbol label 

immediate an expression with an absolute value 

Table 5-9. Jump and Branch Operands 
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Description 



Op-code Operand 



Jump 

Jump and Link 

Branch on Equal 

Branch on Greater 

Branch on Greater/Equal 

Branch on Greater/Equal Unsigned 

Branch on Greater Unsigned 

Branch on Less 

Branch on Less/Equal 

Branch on Less/Equal Unsigned 

Branch on Less Unsigned 

Branch on Not Equal 

Branch on Equal to Zero 
Branch on Greater/Equal Zero 
Branch on Greater Than Zero 
Branch on Less/Equal Zero 
Branch on Less Than Zero 
Branch on Not Equal to Zero 



J 
jal 



beq 

bgt 

bge 

bgeu 

bgtu 

bit 

ble 

bleu 

bltu 

bne 

beqz 

bgez 

bgtz 

blez 

bltz 

bnez 



Branch b 

Branch and Link bal 

Branch on Less Than Zero and Link bltzal 

Branch on Greater or Equal to Zero bgezal 
and Link 



address 
srcl 

srcl,src2, label 1 

srcl, immediate, label 



srcl, label 



( 



label 



Table 5-10. Jump and Branch Instruction Formats 
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5.4.2 Jump and Branch Instruction Descriptions 

In the following branch instructions, branch destinations must be defined 
in the source being assembled. Table 5-11 shows descriptions of jump 
and branch instructions. 



Instruction Name 



Description 



Branch (b) 



Branch and Link (bal) 



Branch on Equal (beq) 



Branch on Equal to Zero (beqz) 



Branch on Greater (bgt) 



Branch on Greater/Equal Unsigned 

(bgeu) 



Branches unconditionally to the 
specified label. 

Branches unconditionally to the 
specified label and puts the return 
address in general register $31. 

Branches to the specified label 
when the contents of srcl equal 
the contents of src2, or it can 
branch when the contents of srcl 
equal the immediate value. 

Branches to the specified label 
when the contents of srcl equal 
zero. 

Branches to the specified label 
when the contents of srcl are 
greater than the contents of src2, 
or it can branch when the contents 
of srcl are greater than the imme- 
diate value. The comparison treats 
the comparands as signed 32-bit 
values. 



Branches to the specified label 
when the contents of srcl are 
greater than or equal to the con- 
tents of src2, or it can branch 
when the contents of srcl are 
greater than or equal to the imme- 
diate value. The comparison treats 
the comparands as unsigned 32-bit 
values. 
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Instruction Name 



Description 



Branch on Greater/Equal Zero 

(bgez) 



Branches to the specified label 
when the contents of srcl are 
greater than or equal to zero. 



Branch on Greater/Equal Zero and Link 

(bgezal) Branches to the specified label 

when the contents of srcl are 
greater than or equal to zero and 
puts the return address in general 
register $31. 



Branch on Greater or Equal 

(bge) 



Branch on Greater Than Unsigned 

(bgtu) 



Branch on Greater Than Zero 

(bgtz) 



Branches to the specified label 
when the contents of srcl are 
greater than or equal to the con- 
tents of src2, or it can branch 
when the contents of srcl are 
greater than or equal to the imme- 
diate value. The comparison treats 
the comparands as signed 32-bit 
values. 



Branches to the specified label 
when the contents of srcl are 
greater than the contents of src2, 
or it can branch when the contents 
of srcl are greater than the imme- 
diate value. The comparison treats 
the comparands as unsigned 32-bit 
values. 



Branches to the specified label 
when the contents of srcl are 
greater than zero. 



( 



( 



Table 5-11. Jump and Branch Instruction Descriptions (continued) 
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Instruction Name 



Description 



Branch on Less (bit) 



Branch on Less/Equal Unsigned 

(bleu) 



Branch on Less/Equal Zero 

(blez) 



Branch on Less or Equal (ble) 



Branch on Less Than Unsigned 

(bltu) 



Branches to the specified label 
when the contents of srcl are less 
than the contents of src2, or it can 
branch when the contents of srcl 
are less than the immediate value. 
The comparison treats the com- 
parands as signed 32-bit values. 

Branches to the specified label 
when the contents of srcl are less 
than or equal to the contents of 
src2, or it can branch when the 
contents of srcl are less than or 
equal to the immediate value. The 
comparison treats the comparands 
as unsigned 32-bit values. 

Branches to the specified label 
when the contents of srcl are less 
than or equal to zero. The pro- 
gram must define the destination. 

Branches to the specified label 
when the contents of srcl are less 
than or equal to the contents of 
src2, or it can branch when the 
contents of srcl are less than or 
equal to the immediate value. The 
comparison treats the comparands 
as signed 32-bit values. 

Branches to the specified label 
when the contents of srcl are less 
than the contents of src2, or it can 
branch when the contents of srcl 
are less than the immediate value. 
The comparison treats the com- 
parands as unsigned 32-bit values. 
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Instruction Name 



Description 



Branch on Less Than Zero 

(bltz) 



Branches to the specified label 
when the contents of srcl are less 
than zero. The program must de- 
fine the destination. 



Branch on Less Than Zero and Link 



(bltzal) 



Branch on Not Equal (bne) 



Branch on Not Equal to Zero 

(bnez) 



Jump (j) 



Branches to the specified label 
when the contents of srcl are less 
than zero and puts the return ad- 
dress in general register $31. 

Branches to the specified label 
when the contents of srcl do not 
equal the contents of src2, or it 
can branch when the contents of 
srcl do not equal the immediate 
value. 



Branches to the specified label 
when the contents of srcl do not 
equal zero. 

Unconditionally jumps to a speci- 
fied location. A symbolic address 
or a general register specifies the 
destination. The instruction j $31 
returns from the a jal call instruc- 
tion. 



c 



( 
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Instruction Name Description 

Jump And Link (jal) Unconditionally jumps to a speci- 

fied location and puts the return 
address in general register $31. A 
symbolic address or a general regis- 
ter specifies the destination. The 
instruction jal procname transfers 
to procname and saves the return 
address. 

The machine does not allow a 
Jump and Link to register $31. 



Table 5-11. Jump and Branch Instruction Descriptions (continued) 

5.5 Special Instructions 

The main processor's special instructions do miscellaneous tasks. 

5.5.1 Special Formats 

Table 5-12 shows special format operands. Table 5-13 shows the formats 
of special instructions. 

Operand Description __^ 

register destination or source register 

breakcode value that determines the break type 

Table 5-12. Special Operands Instructions 
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Description 


Op-code 


Operand 


Break 


break 


breakcode 


Restore From Exception 


rfe 




Syscall 


syscall 




Move From HI Register 


mfhi 


register 


Move To HI Register 


mthi 




Move From LO Register 


mflo 




Move To LO Register 


mtlo 





( 



Table 5-13. Special Instruction Formats 
5.5.2 Special Instruction Descriptions 

Table 5-14 shows descriptions of special instructions. 



( 



Instruction Name 



Description 



Break (break) 



Move From HI Register (mfhi) 



Move From LO Register (mflo) 



Move To HI Register (mthi) 



Move To LO Register (mtlo) 



Unconditionally transfers control to 
the exception handler. The break- 
code operand is interpreted by 
software conventions. 

Moves the contents of the HI reg- 
ister to a general purpose register. 

Moves the contents of the LO reg- 
ister to a general purpose register. 

Moves the contents of a general 
purpose register to the HI register. 

Moves the contents of a general 
purpose register to the LO register. 



Table 5-14. Special Instruction Descriptions 
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Instruction Name Description 

Restore From Exception (rfe) Restores the previous interrupt 

callee and user/kernel state. This 
instruction can execute only in ker- 
nel state and is unavailable in user 
mode. 

Syscall (syscall) Causes a system call trap. The op- 

erating system interprets the infor- 
mation set in registers to determine 
what system call to do. 



Table 5-14. Special Instruction Descriptions (continued) 



5.6 Coprocessor Interface Instructions 

The coprocessor interface instructions provide standard ways to access the 
machine's coprocessors. 

Note: You cannot use coprocessor load and store instructions with the 
system control coprocessor (cpO). 



5.6.1 Coprocessor Interface Formats 

Table 5-15 shows coprocessor interface operands. Table 5-16 shows the 
formats of coprocessor interface descriptions. 
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Operand 



Description 



a coprocessor number in the range 0...3 
the destination coprocessor register 
the destination general register 

a symbolic expression 

a coprocessor register from which values are assigned 

a general register from which values are assigned 

the coprocessor specific operation 

a symbolic label 



destination 

dest-gpr 

address 

source 

src-gpr 

operation 

label 



( 



Table 5-15. Coprocessor Interface Operands 



Description 


Op-code 


Operand 


Load Word Coprocessor z 


Iwcz 


destination, address 


Store Word Coprocessor z 


swcz 


source, address 


Move From Coprocessor z 


mfcz 


dest-gpr, source 


Move To Coprocessor z 


mtcz 


src-gpr, destination 


Branch Coprocessor z False 


bczf 


label 


Branch Coprocessor z True 


bczt 




Coprocessor z Operation 


cz 


expression 


Control From Coprocessor z 


cfcz 


dest-gpr, source 


Control To Coprocessor z 


ctcz 


src-gpr, destination 



( 



Table 5-16. Coprocessor Interface Instruction Formats 
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5.6.2 Coprocessor Interface Instruction Descriptions 

Table 5-17 shows descriptions of coprocessor interface instructions. 



Instruction Name 



Description 



Branch Coprocessor z True 

(bczt) 



Branch Coprocessor z False 
(bczf) 



Control From Coprocessor z 

(cfcz) 



Control To Coprocessor z 

(ctcz) 



Coprocessor z Operation (cz) 



Branches to the specified label 
when the specified coprocessor as- 
serts a true condition. The z se- 
lects one of the coprocessors. A 
previous coprocessor operation sets 
the condition. 



Branches to the specified label 
when the specified coprocessor as- 
serts a false condition. The z se- 
lects one of the coprocessors. A 
previous coprocessor operation sets 
the condition. 



Stores the contents of the 
coprocessor control register speci- 
fied by the source in the general 
register specified by dest-gpr. 

Stores the contents of the general 
register specified by src-gpr in the 
coprocessor control register speci- 
fied by the destination. 

Executes a coprocessor-specific op- 
eration on the specified coproces- 
sor. The z selects one of four dis- 
tinct coprocessors. 



Table 5-17. Coprocessor Interface Instruction Descriptions 
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Instruction Nam© 



Description 



Load Word Coprocessor z 

(lwcz) 



Move From Coprocessor z 

(mfcz) 



Move To Coprocessor z (mtcz) 



Store Word Coprocessor z 

(swcz) 



Loads the destination with the con- 
tents of a word that is at the mem- 
ory location specified by the effec- 
tive address. The z selects one of 
four distinct coprocessors. Load 
Word Coprocessor replaces all 
register bytes with the contents of 
the loaded word. If bits and 1 
of the effective address are not 
zero, the machine signals an ad- 
dress exception. 

Stores the contents of the 
coprocessor register specified by 
the source in the general register 
specified by dest-gpr. 

Stores the contents of the general 
register specified by src-gpr in the 
coprocessor register specified by 
the destination. 



Stores the contents of the 
coprocessor register in the memory 
location specified by the effective 
address. The z selects one of four 
distinct coprocessors. If bits and 
1 of the effective address are not 
zero, the machine signals an ad- 
dress error exception. 



( 



c 
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6. Coprocessor Instruction Set 

Chapter 6 describes the coprocessor instructions for these coprocessors: 

• system control coprocessor (cpO) instructions 

• floating point coprocessor instructions 

See Chapter 5 for a description of the main processor's instructions and 
the coprocessor interface instructions. 

6.1 Instruction Notation 



The tables in this chapter list the assembler format for each coprocessor's 
load, store, computational, jump, branch, and special instructions. The 
format consists of an op-code and a list of operand formats. The tables 
list groups of closely related instructions; for those instructions, you can 
use any op-code with any specified operand. NOTE: The system control 
coprocessor instructions do not have operands. Operands can have any of 
these formats: 

• memory references— for example a relocatable symbol +/- an expres- 
sion (register) 

• expressions (for immediate values) 

• two or three operands— for example, add $3, $4 is the same as add 

$3, $3, $4 
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The following terms are used to discuss floating point operations: 

• infinite— A value of +00 or -00. 

• infinity— A symbolic entity that represents values with magnitudes 
greater than the largest value in that format. 

• ordered— The usual result from a comparison, namely: <,=, or > 

• NaN— Symbolic entities that represent values not otherwise available in 
floating point formats. There are two kinds of NaNs. Quiet NaNs 
represent unknown or uninitialized values. Signaling NaNs represent 
symbolic values and values that are too big or too precise for the for- 
mat. Signaling NaNs raise an invalid operation exception whenever an 
operation is attempted on them. 

• unordered— The condition that results from a floating-point compari- 
son when one or both operands are NaNs. 



( 



6.2 Floating Point Instructions 



Table 6-1 shows the classes of the floating point coprocessor instructions. 



( 



Instruction 



Description 



Load and Store Instructions 

Computational Instructions 

Relational Instructions 
Move Instructions 



Load values and move data be- 
tween memory and coprocessor 
registers. 

Do arithmetic and logical opera- 
tions on values in coprocessor reg- 
isters. 

Compare two floating point values. 

Move data between registers. 



Table 6-1. Floating Point Coprocessor Instructions 

A particular floating point instruction may be implemented in hardware, 
software, or a combination of hardware and software. 



( 
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6.2.1 Floating Point Formats 

The formats for the single and double precision floating point constants 
are shown below. 





189 31 (big-endian) 








1 


8 bits 


23 bits 








31 30 23 22 C 
Single Precision 

1 1112 


1 (little-endian) 


(big-endian) 
63 




1 


11 bits 


52 bits 






63 62 52 51 

Double Precision 


C 
(little— 


> 

endian) 



Figure 6-1. Floating Point Formats 



6.2.2 Floating Point Load and Store Formats 

Floating point load and store instructions must use even registers. Table 
6-2 shows floating point operands. Table 6-3 shows the floating point 
load and store formats. 
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Operand 


Meaning 


destination 


the destination register 


address 


offset (base) 


source 


the source register 



Table 6-2, Floating Point Load and Store Operands 



c 



Description 


Op-code 


Operand 


Load Fp 






Double 


Ld 


destination, address 


Single 


l.s 




Store Fp 






Double 


s.d 


source, address 


Single 


s.s 





Table 6-3. Floating Point Load and Store Formats 



( 



6.2.3 Floating Point Load and Store Descriptions 

Table 6-4 groups the instructions by function. Please consult Table 6-3 
for the op-codes. 



( 
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Instruction 



Description 



Load Fp Instructions 



Store Fp Instructions 



Load eight bytes for double preci- 
sion and four bytes for single preci- 
sion from the specified effective 
address into the destination regis- 
ter, which must be an even regis- 
ter. The bytes must be word 
aligned. NOTE: To ensure com- 
patibility with future machines, we 
recommend that you use double 
word alignment for double preci- 
sion operands. 

Stores eight bytes for double preci- 
sion and four bytes for single preci- 
sion from the source floating point 
register in the destination register, 
which must be an even register. 
NOTE: To ensure compatibility 
with future machines, we recom- 
mend that you use double word 
alignment for double precision op- 
erands* 



Table 6-4. Floating Point Load and Store Descriptors 



6.2.4 Floating Point Computational Formats 

This part of Chapter 6 describes floating point computational instructions. 
Table 6-5 shows floating point computational operands. Table 6-6 shows 
the formats of floating point computational instructions. 



Operand 



Meaning 



destination the destination register 

source the source register 

gpr general purpose register 

Table 6-5. Floating Point Computational Operands 
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Description 


Op-code 


Operand 


Absolute Value Fp 






Double 


abs.d 


destination, srcl 


Single 


abs.s 




Negate Fp 






Double 


neg.d 




Single 


neg.s 




Add Fp 






Double 


add.d 


destination, srcl, src2 


Single 


add.s 




Divide Fp 






Double 


div.d 




Single 


div.s 




Multiply Fp 






Double 


mul.d 




Single 


mul.s 




Subtract Fp 






Double 


sub.d 




Single 


sub.s 





c 



Table 6-6. Floating Point Computational Instruction Formats 



c 



( 
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Description 


Op-code 


Operand 


Convert Source to 
Specified Fp Precision 






Double to Single Fp 
Fixed Point to Single Fp 
Single to Double Fp 


cvt.s.d 
cvt.s.w 
cvt.d.s 


destination, srcl 


Fixed Point to Double Fp 
Single to Fixed Point Fp 
Double to Fixed Point Fp 


cvt.d.w 
cvt.w.s 
cvt.w.d 




Truncate and Round 
Operations 




destination, src, gpr 


Truncate to Single Fp 
Truncate to Double Fp 


trunc.w.s 
trunc.w.d 




Round to Single Fp 
Round to Double Fp 


round, w.s 
round, w.d 





Table 6-6. Floating Point Computational Instruction Formats 

(continued) 



6.2.5 Floating Point Computational Instruction Descriptions 

Table 6-7 groups the instructions by function. Please consult Table 6-6 
for the op-code names. 
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Instruction 



Description 



Absolute Value Fp Instructions Compute the absolute value of the 

contents of srcl and put the speci- 
fied precision floating point result 
in the destination register. 

Add Fp Single Instructions Add the contents of srcl (or the 

destination) to the contents of src2 
and put the result in the destina- 
tion register. When the sum of 
two operands with opposite signs is 
exactly zero, the sum has a positive 
sign for all rounding modes except 
round toward -co. For that 
rounding mode, the sum has a 
negative sign. 

Convert Source to Another Precision Fp Instructions 

Convert the contents of srcl to the 
specified precision, round accord- 
ing to the rounding mode, and put 
the result in the destination regis- 
ter. 



Truncate and Round Instructions 



The trunc instructions truncate the 
value in the source floating-point 
register and put the resulting inte- 
ger in the destination floating-point 
register, using the third (general- 
purpose) register to hold a tempo- 
rary value. (This is a macro-in- 
struction.) The round instructions 
work like trunc, but round the 
floating-point value to an integer 
instead of truncating it. 



( 



Table 6-7. Floating Point Computational Instruction Descriptions 



( 
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Instruction 



Description 



Divide Fp Instructions 



Multiply Fp Instructions 



Negate FP Instructions 



Subtract Fp Instructions 



Compute the quotient of two val- 
ues. These instructions treat srcl 
as the dividend and src2 as the di- 
visor. Divide Fp instructions di- 
vide the contents of srcl by the 
contents of src2 and put the result 
in the destination register. If the 
divisor is a zero, the machine sig- 
nals a error if the divide-by-zero 
exception is enabled. 

Multiplies the contents of srcl (or 
the destination) with the contents 
of src2 and puts the result in the 
destination register. 

Compute the negative value of the 
contents of srcl and put the speci- 
fied precision floating point result 
in the destination register. 

Subtract the contents of src2 from 
the contents of srcl (or the desti- 
nation). These instructions put the 
result in the destination register. 
When the difference of two oper- 
ands with the same signs is exactly 
zero, the difference has a positive 
sign for all rounding modes except 
round toward -oo. For that 
rounding mode, the sum has a 
negative sign. 



Table 6-7. Floating Point Computational Instruction Descriptions 

(continued) 
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6.3 Floating Point Relational Operations 

Table 6-8 summarizes the floating point relational instructions. The first 
column under Condition gives a mnemonic for the condition tested. As 
the "branch on true/false" condition can be used to logically negate any 
condition, the second column supplies a mnemonic for the logical negation 
of the condition in the first column. This provides a total of 32 possible 
conditions. The four columns under Relations give the result of the com- 
parison based on each condition. The final column states if an invalid 
operation is signaled for each condition. 

For example, with an equal condition (EQ mnemonic in the True col- 
umn) , the logical negation of the condition is not equal (NEQ) , and a 
comparison that is equal is True for equal and False for greater than, less 
than, and unordered, and no Invalid Operation Exception is given if the 
relation is unordered. 



Condition 


Relations 


Invalid 
Operation 

Exception if 
Unordered 


Mnemonic 


Greater 
Than 


Less 
Than 


Equal 


Unordered 


True 


False 


F 

UN 
EQ 
UEQ 
OLT 
ULT 
OLE 
ULE 


T 

OR 
NEQ 
OLG 
UGE 
OGE 
UGT 
OGT 


F 
F 
F 
F 
F 
F 
F 
F 


F 
F 
F 
F 
T 
T 
T 
T 


F 
F 
T 
T 
F 
F 
T 
T 


F 
T 
F 
T 
F 
T 
F 
T 


no 
no 
no 
no 
no 
no 
no 
no 


SF 

NGLE 

SEQ 

NGL 
LT 

NGE 
LE 

NGT 


ST 
GLE 

SNE 
GL 

NLT 
GE 

NLE 
GT 


F 
F 
F 
F 
F 
F 
F 
F 


F 
F 
F 
F 
T 
T 
T 
T 


F 
F 
T 
T 
F 
F 
T 
T 


F 
T 
F 
T 
F 
T 
F 
T 


yes 
yes 
yes 
yes 
yes 
yes 
yes 
yes 



( 



( 



Table 6-8. Floating Point Relational Operators 
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The mnemonics in Table 6-8 have the following meanings: 



F False 

UN Unordered 

EQ Equal 

UEQ Unordered or Equal 



OLT Ordered Less Than 



ULT 


Unordered or Less T 


OLE 


Ordered Less Than 




or Equal 


ULE 


Unordered or Less 




Than or Equal 


SF 


Signaling False 


NGLE 


Not Greater Than or 




Less Than or Equal 


SEQ 


Signaling Equal 


NGL 


Not Greater than or 


LT 


Less Than 


NGE 


Not Greater Than 




or Equal 


LE 


Less Than or Equal 


NGT 


Not Greater Than 



T 


True 


OR 


Ordered 


NEQ 


Not Equal 


OLG 


Ordered or 




Less Than 




or Greater 




Than 


UGE 


Unordered or 




Greater Than 




or Equal 


OGE 


Ordered 




Greater Than 


UGT 


Unordered or 




Greater Than 


OGT 


Ordered 




Greater Than 


ST 


Signaling True 


GLE 


Greater Than, 




or Less 




Than or Equal 


SNE 


Signaling Not 




Equal 


GL 


Greater Than 




or Less Than 


NLT 


Not Less Than 


GE 


Greater Than 




or Equal 


NLE 


Not Less Than 




or Equal 


GT 


Greater Than 



To branch on the result of a relational: 

/* branching on a compare result */ 



c.eq.s $fl,$f2 
belt true 
bclf false 



/* compare the single precision values */ 

/* if $fl equals $f2, branch to true */ 

/* if $fl does not equal $f2, branch to */ 

/* false */ 
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6.3.1 Floating Point Relational Formats 

In Table 6-9 srcl and src2 refer to the source registers. 



Description 


Op-code Operand 


Compare F 




Double 


c.f.d srcl,src2 


Single 


c.f.s 


Compare UN 




Double 


c.un.d 


Single 


c.un.s 


* Compare EQ 




Double 


c.eq.d 


Single 


c.eq.s 


Compare UEQ 




Double 


c.ueq.d 


Single 


c.ueq.s 


Compare OLT 




Double 


c.olt.d 


Single 


c.olt.s 


Compare ULT 




Double 


c.ult.d 


Single 


cults 


Compare OLE 




Double 


c.ole.d 


Single 


c.ole.s 


Compare ULE 




Double 


c.ule.d 


Single 


c.ule.s 


Compare SF 




Double 


c.sf.d 


Single 


c.sf.s 



( 



( 



* These are the most common Compare instructions. 
The machine provides other Compare instructions for 
IEEE compatibility. 

Table 6-9. Floating Point Relational Instruction Formats 



( 
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Description 


Op-code 


Operand 


Compare NGLE 






Double 


c.ngle.d 


srcl, src2 


Single 


c.ngle.s 




Compare SEQ 






Double 


c.seq.d 




Single 


c.seq.s 




Compare NGL 






Double 


c.ngl.d 




Single 


c.ngl.s 




* Compare LT 






Double 


c.lt.d 




Single 


c.lt.s 




Compare NGE 






Double 


c.nge.d 




Single 


c.nge.s 




* Compare LE 






Double 


c.le.d 




Single 


c.le.s 




Compare NGT 






Double 


c.ngt.d 




Single 


c.ngt.s 





* These are the most common Compare instructions. 
The machine provides other Compare instructions for 
IEEE compatibility. 

Tabie 6-9. Floating Point Relational Instruction Formats 
(continued) 



6.3.2 Floating Point Relational Instruction Descriptions 

This part of Chapter 6 describes the relational instruction descriptions by 
function. Refer to Chapter 1 for information regarding registers. Please 
consult Table 6-9 for the op-code names. 
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Instruction 



Description 



Compare EQ Instructions 



Compare F Instructions 



Compare LE Instructions 



Compare LT Instructions 



Compare NGE Instructions 



Compare the contents of srcl with 
the contents of src2. If srcl 
equals src2 a true condition results; 
otherwise, a false condition results. 
The machine does not signal an 
exception for unordered values. 

Compare the contents of srcl with 
the contents of src2. These in- 
structions always produce a false 
condition. The machine does not 
signal an exception for unordered 
values. 

Compare the contents of srcl with 
the contents of src2. If srcl is less 
than or equal to src2, a true condi- 
tion results; otherwise, a false con- 
dition results. The machine signals 
an exception for unordered values. 

Compare the contents of srcl with 
the contents of src2. If srcl is less 
than src2, a true condition results; 
otherwise, a false condition results. 
The machine signals an exception 
for unordered values. 

Compare the contents of srcl with 
the contents of src2. If srcl is less 
than src2 (or the contents are un- 
ordered), a true condition results; 
otherwise, a false condition results. 
The machine signals an exception 
for unordered values. 



( 



Table 6-10. Floating Point Relational Instruction Descriptions 



c 
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Instruction 



Description 



Compare NGL Instructions 



Compare NGLE Instructions 



Compare NGT Instructions 



Compare the contents of srcl with 
the contents of src2. If srcl 
equals src2 or the contents are un- 
ordered, a true condition results; 
otherwise, a false condition results. 
The machine signals an exception 
for unordered values. 

Compare the contents of srcl with 
the contents of src2. If srcl is un- 
ordered, a true condition results; 
otherwise, a false condition results. 
The machine signals an exception 
for unordered values. 

Compare the contents of srcl with 
the contents of src2. If srcl is less 
than or equal to src2 or the con- 
tents are unordered, a true condi- 
tion results; otherwise, a false con- 
dition results. The machine signals 
an exception for unordered values. 



Compare OLE Instructions 



Compare OLT Instructions 



Compare the contents of srcl with 
the contents of src2. If srcl is less 
than or equal to src2, a true condi- 
tion results; otherwise, a false con- 
dition results. The machine does 
not signal an exception for unor- 
dered values. 

Compare the contents of srcl with 
the contents of src2. If srcl is less 
than src2, a true condition results; 
otherwise, a false condition results. 
The machine does not signal an 
exception for unordered values. 



Table 6-10. Floating Point Relational Instruction Descriptions 

(continued) 
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Instruction 



Description 



Compare SEQ Instructions 



Compare SF Instructions 



Compare ULE Instructions 



Compare UEQ Instructions 



Compare ULT Instructions 



Compare the contents of srcl with 
the contents of src2. If srcl 
equals src2, a true condition re- 
sults; otherwise, a false condition 
results. The machine signals an 
exception for unordered values. 

Compare the contents of srcl with 
the contents of src2. This always 
produces a false condition. The 
machine signals an exception for 
unordered values. 

Compare the contents of srcl with 
the contents of src2. If srcl is less 
than or equal to src2 (or srcl is 
unordered), a true condition re- 
sults; otherwise, a false condition 
results. The machine does not sig- 
nal an exception for unordered 
values. 

Compare the contents of srcl with 
the contents of src2. If srcl 
equals src2 (or srcl and src2 are 
unordered), a true condition re- 
sults; otherwise, a false condition 
results. The machine does not sig- 
nal an exception for unordered 
values. 

Compare the contents of srcl with 
the contents of src2. If srcl is less 
than src2 (or the contents are un- 
ordered), a true condition results; 
otherwise, a false condition results. 
The machine does not signal an 
exception for unordered values. 



Table 6-10. Floating Point Relational Instruction Descriptions 

(continued) 



( 



( 



c 
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Instruction 



Description 



Compare UN Instructions 



Compare the contents of srcl with 
the contents of src2. If either srcl 
or src2 is unordered, a true condi- 
tion results; otherwise, a false con- 
dition results. The machine does 
not signal an exception for unor- 
dered values. 



Table 6-10. Floating Point Relational Instruction Descriptions 

(continued) 



6.3.3 Floating Point Move Formats 

The floating point coprocessor's move instructions move data from source 
to destination registers (only floating point registers are allowed) . 



Description 



Op-code 



Operand 



Move Fp 

Double 

Single 



mov.d 
mov.s 



destination, srcl 



Table 6-11. Floating Point Move Instruction Formats 
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6.3.4 Floating Point Move instruction Descriptions 

This part of Chapter 6 describes the floating point move instructions. 
Please consult Table 6-11 for the op-code names. 



Instruction Description 

Move Fp Instructions Move the double or single preci- 

sion contents of srcl to the desti- 
nation register, maintaining the 
specified precision. 



Table 6-12. Floating Point Move Instruction Descriptions 



6.4 System Control Coprocessor Instructions 

The system control coprocessor (cpO) handles all functions and special 
and privileged registers for the virtual memory and exception handling sub- 
systems. The system control coprocessor translates addresses from a large £ 
virtual address space into the machine's physical memory space. The V. 

coprocessor uses a translation lookaside buffer (TLB) to translate virtual 
addresses to physical addresses. 

Primarily, only people who need to change the operating system need 
these instructions. 



( 
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6.4.1 System Control Coprocessor Formats 

These coprocessor system control instructions do not have operands: 



Description 



Op-cod© Operand 



Translation Lookaside Buffer Probe tlbp 

Translation Lookaside Buffer Read tlbr 

Translation Lookaside Buffer Write Random tlbwr 

Translation Lookaside Write Index tlbwi 

Table 6-13. System Control Instruction Formats 



6.4.2 System Control Coprocessor Instruction Descriptions 

This part of Chapter 6 describes the system control coprocessor instruc- 
tions. 



Instruction 



Description 



Translation Lookaside Buffer Probe 

(tibp) 



Translation Lookaside Buffer Read 

(tlbr) 



Probes the translation lookaside 
buffer (TLB) to see if the TLB has 
an entry that matches the contents 
of the EntryHi register. If a 
match occurs, the machine loads 
the Index register with the number 
of the entry that matches the 
EntryHi register. If no TLB entry 
matches, the machine sets the 
high-order bit of the Index regis- 
ter. 



Loads the EntryHi and EntryLo 
registers with the contents of the 
translation lookaside buffer (TLB) 
entry specified in the TLB Index 
register. 



Table 6-14. System Control Coprocessor Instruction Descriptions 
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Instruction Description ___ 

Translation Lookaside Buffer Write Random 

(tlbwr) Loads the specified translation 

lookaside buffer (TLB) entry with g- 

the contents of the EntryHi and 1 

EntryLo registers. The contents of 
the TLB Random register specify 
the TLB entry to be loaded. 

Translation Lookaside Buffer Write Index 

(tlbwi) Loads the specified translation 

lookaside buffer (TLB) entry with 
the contents of the EntryHi and 
EntryLo registers. The contents of 
the TLB Index register specify the 
TLB entry to be loaded. 



Table 6-14. System Control Coprocessor Instruction Descriptions 

(continued) 



6.5 Control and Status Register 

Floating-point coprocessor control register 31 contains status and control 
information. It controls the arithmetic rounding mode and the enabling of 
user-level traps, and indicates exceptions that occurred in the most re- 
cently executed instruction, and any exceptions that may have occurred 
without being trapped. 



( 



( 
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Figure 6-2. Floating Point Control and Status Register 31 



The exception bits are set for instructions that cause an IEEE standard 
exception or an optional exception used to emulate some of the more 
hardware-intensive features of the IEEE standard. 

The exception field is loaded as a side-effect of each floating-point opera- 
tion (excluding loads, stores, and unformatted moves). The exceptions 
which were caused by the immediately previous floating-point operation 
can be determined by reading the exception field. 

The meaning of each bit in the exception field is given in Table 6-15. If 
two exceptions occur together on one instruction, the field will contain the 
inclusive OR of the bits for each exception. 
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Exception 
Field Bit 


Description 


E 


Unimplemented Operation 


V 


Invalid Operation 


z 


Division by Zero 


I 


Inexact Exception 





Overflow Exception 


u 


Underflow Exception 



Table 6-15. Exception Field Bit Descriptions 

The unimplemented operation exception is normally invisible to user-level 
code. It is provided to maintain IEEE compatibility for non-standard im- 
plementations. 

The five IEEE standard exceptions are listed below: 



Field 


Description 


V 


Invalid Operation 


z 


Division by Zero 


I 


Inexact Exception 


o 


Overflow Exception 


u 


Underflow Exception 



Table 6-16. IEEE Standard Exceptions 

Each of the five exceptions is associated with a trap under user control, which 
is enabled by setting one of the five bits of the enable field, shown above. 

When an exception occurs, both the corresponding exception and status 
bits are set. If the corresponding enable flag bit is set, a trap is taken. In 
some cases the result of an operation is different if a trap is enabled. 
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( 



( 



C 



The status flags are never cleared as a side effect of floating-point operations, 
but may be set or cleared by writing a new value into the status register, using 
a "move to coprocessor control" instruction. 

The floating-point compare instruction places the condition which was de- 
tected into the "c" bit of the control and status register, so that the state of 
the condition line may be saved and restored. The "c" bit is set if the condi- 
tion is true, and cleared if the condition is false, and is affected only by 
compare and move to control register instructions. 



6.5.1 Exception Trap Processing 

For each IEEE standard exception, a status flag is provided that is set on 
any occurrence of the corresponding exception condition with no corre- 
sponding exception trap signaled. It may be reset by writing a new value 
into the status register. The flags may be saved and restored individually, 
or as a group, by software. When no exception trap is signaled, a default 
action is taken by the floating-point coprocessor, which provides a substi- 
tute value for the original, exceptional, result of the floating-point opera- 
tion. The default action taken depends on the type of exception, and in 
the case of the Overflow exception, the current rounding mode. 



Invalid operation exception 

The invalid operation exception is signaled if one or both of the operands 
are invalid for an implemented operation. The result, when the exception 
occurs without a trap, is a quiet NaN when the destination has a floating- 
point format, and is indeterminate if the result has a fixed-point format. 
The invalid operations are 

1. Addition or subtraction: magnitude subtraction of infinities, such as 
( + oo ) - ( - oo ) 

2. Multiplication: times oo, with any signs 

3. Division: over or oo over oo, with any signs 

4. Square root: rx, where x is less than zero 

5. Conversion of a floating-point number to a fixed-point format when 
an overflow, or operand value of infinity or NaN, precludes a faithful 
representation in that format 
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6. Comparison of predicates involving < or > without ?, when the oper- 
ands are "unordered" 

7. Any operation on a signaling NaN. 

Software may simulate this exception for other operations that are invalid 
for the given source operands. Examples of these operations include 
IEEE-specified functions implemented in software, such as Remainder: x 
REM y, where y is zero or x is infinite; conversion of a floating-point 
number to a decimal format whose value causes and overflow or is infinity 
of NaN; and trancendental functions, such as In (-5) or cos ~~\3). 



Division-by-zero exception 

The division by zero exception is signaled on an implemented divide opera- 
tion if the divisor is zero and the dividend is a finite nonzero number. The 
result, when no trap occurs, is a correctly signed infinity. 

If division by zero traps are enabled, the result register is not modified, and 
the source registers are preserved. 

Software may simulate this exception for other operations that produce a ^ 
signed infinity, such as ln(0), sec(ir/2), csc(O) or . 1 

Overflow exception 

The overflow exception is signaled when what would have been the magni- 
tude of the rounded floating-point result, were the exponent range un- 
bounded, is larger than the destination format's largest finite number. 
The result, when no trap occurs, is determined by the rounding mode and 
the sign of the intermediate result. 

If overflow traps are enabled, the result register is not modified, and the 
source registers are preserved. 



Underflow exception 

Two related events contribute to underflow. One is the creation of a tiny 

non-zero result between + or - 2 E min (minimum expressable exponent) 
which, because it is tiny, may cause some other exception later. The 
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other is extraordinary loss of accuracy during the approximation of such 
tiny numbers by denormalized numbers. 

The IEEE standard permits a choice in how these events are detected, but 
requires that they must be detected the same way for all operations. 

The IEEE standard specifies that "tininess" may be detected either: "after 
rounding" (when a nonzero result computed as though the exponent range 

were unbounded would lie strictly between + or - 2 E min^ or "before 
rounding" (when a nonzero result computed as though the exponent range 
and the precision were unbounded would lie strictly between + or - 

2 E min. The architecture requires that tininess be detected after round- 
ing. 

Loss of accuracy may be detected as either "denormalization loss" (when 
the delivered result differs from what would have been computed if the 
exponent range were unbounded), or "inexact result" (when the delivered 
result differs from what would have been computed if the exponent range 
and precision were both unbounded) . The architecture requires that loss 
of accuracy be detected as inexact result. 

When an underflow trap is not enabled, underflow is signaled (via the un- 
derflow flag) only when both tininess and loss of accuracy have been de- 
tected. The delivered result might be zero, denormalized, or + or - 

2 E min. When an underflow trap is enabled, underflow is signaled when 
tininess is detected regardless of loss of accuracy. 

If underflow traps are enabled, the result register is not modified, and the 
source registers are preserved. 



Inexact exception 

If the rounded result of an operation is not exact or if it overflows without 
an overflow trap, then the inexact exception is signaled. The rounded or 
overflowed result is delivered to the destination register, when no inexact 
trap occurs. If inexact exception traps are enabled, the result register is 
not modified, and the source registers are preserved. 



Unimplemented operation exception 

If an operation is specified that the hardware may not perform, due to an 
implementation restriction on the supported operations or supported for- 
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mats, an unimplemented operation exception may be signaled, which al- 
ways causes a trap, for which there are no corresponding enable or flag 
bits. The trap cannot be disabled. 

This exception is raised at the execution of the unimplemented instruction. 
The instruction may be emulated in software, possibly using implemented 
floating-point unit instructions to accomplish the emulation. Normal in- 
struction execution may then be restarted. 

This exception is also raised when an attempt is made to execute an in- 
struction with an operation code or format code which has been reserved 
for future architectural definition. The unimplemented instruction trap is 
not optional, since the current definition contains codes of this kind. 

This exception may be signaled when unusual operands or result condi- 
tions are detected, for which the implemented hardware cannot properly 
handle the condition. These may include (but are not limited to), denor- 
malized operands or results, NaN operands, trapped overflow or underflow 
conditions. The use of this exception for such conditions is optional. 



6.5.2 Floating Point Rounding 

Bits and 1 of the coprocessor control register 31 sets the rounding mode 
for floating point. The machine allows four rounding modes: 

• Round to nearest rounds the result to the nearest representable value. 
When the two nearest representable values are equally near, this mode 
rounds to the value with the least significant bit zero. To select this 
mode, set bits 1..0 of control register 31 to 0. 

• Round toward zero rounds toward zero. It rounds to the value that 
is closest to and not greater in magnitude than the infinitely precise 
result. To select this mode, set bits 1..0 of control register 31 to 1. 

• Round toward positive infinity rounds to the value that is closest to 
and not less than the infinitely precise result. To select this mode, set 
bits 1..0 of control register 31 to 2. 

• Round toward negative infinity rounds toward negative infinity. It 
rounds to the value that is closest to and not greater than the infinitely 
precise result. To select this mode, set bits L.O of control register 31 
to 3. 



( 



( 
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To set the rounding mode: 

/* setting the rounding mode */ 

RoundNearest = OxO 

RoundZero = 0x1 

RoundPosInf = 0x2 

RoundNeglnf = 0x3 

cfcl rt2, $31 # move from coprocessor 1 

and rt, Oxfffffffc # zero the round mode bits 
or rt , RoundZero # set mask as round to zero 

ctcl rt , $f31 # move to coprocessor 1 
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c 



( 



7. Linkage Conventions 



This chapter gives rules and examples to follow when designing an assem- 
bly language program. The chapter concludes with a "learn by doing" 
technique that you can use if you still have any doubts about how a par- 
ticular calling sequence should work. This involves writing a skeleton ver- 
sion of your prospective assembly routine using a high level language, and 
then compiling it with the -S option to generate a human-readable assem- 
bly language file. The assembly language file can then be used as the 
starting point for coding your routine. 



7.1 Introduction 



When you write assembly language routines, you should follow the same 
calling conventions that the compilers observe, for two reasons: 

• Often your code must interact with compiler-generated code, accepting 
and returning arguments or accessing shared global data. 

• The symbolic debugger gives better assistance in debugging programs 
using standard calling conventions. 
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The conventions for the compiler system are a bit more complicated than 
some, mostly to enhance the speed of each procedure call. Specifically: 

• The compilers use the full, general calling sequence only when neces- 
sary; where possible, they omit unneeded portions of it. For exam- 
ple, the compilers don't use a register as a frame pointer whenever 
possible. 

• The compilers and debugger observe certain implicit rules rather than 
communicating via instructions or data at execution time. For exam- 
ple, the debugger looks at information placed in the symbol table by a 
".frame" directive at compilation time, so that it can tolerate the lack 
of a register containing a frame pointer at execution time. 



7.2 Program Design 

This section describes three general areas of concern to the assembly lan- 
guage programmer: 

• usable and restricted registers 

• stack frame requirements on entering and exiting a routine 

• the "shape" of data (scalars, arrays, records, sets) laid out by the vari- 
ous high level languages 



7.2.1 Register Use and Linkage 

The main processor has 32 32-bit integer registers. The uses and restric- 
tions of these registers are described in Table 7-1. 

The floating point coprocessor has 16 floating point registers. Each regis- 
ter can hold either a single precision (32 bit) or a double precision (64 
bit) value. All references to these registers uses an even register number 
(e.g., $f4). Refer to Table 7-2. 



( 



( 



( 
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software name 

register name ,, . . . . use and linkage 

* (from regdef.h) * 

$0 always has the value 

$at reserved for the assembler 

$2.. $3 vO-vl used for expression evaluations and to 

hold the integer type functions results. 
Also used to pass the static link when 
calling nested procedures. 

$4.. $7 a0-a3 used to pass the first 4 words of integer 

type actual arguments, whose values are 
not preserved across procedure calls 

$8.. $15 t0-t7 temporary registers, used for expression 

evaluations, whose values are not preserved 
across procedure calls. 

$16.. $23 s0-s7 saved registers, whose values must be 

preserved across procedure calls. 

$24.. $25 t8-t9 temporary registers, used for expression 

evaluations, whose values are not pre- 
served across procedure calls. 

reserved for the operating system kernel 

contains the global pointer 

contains the stack pointer 

contains the frame pointer (if needed) 

contains the return address; used for 
expression evaluation. 

Table 7-1. Integer Registers 



$kt0..$ktl 


kO-kl 


$28 or $gp 


gP 


$29 or $sp 


sp 


$30 or $fp 


fp 


$31 


ra 
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register name use and linkage 



$f0..f3 used to hold floating point type function 

results ($f0) and complex type function 
results ($f0 has the real part, $f2 has 
the imaginary part) 

$f4..fl0 temporary registers, used for expression 

evaluation, whose values are not pre- 
served across procedure calls. 

$fl2..$fl4 used to pass the first 2 single or double 

precision actual arguments, whose values 
are not preserved across procedure calls. 

$f!6..$fl8 temporary registers, used for expression 

evaluations, whose values are not pre- 
served across procedure calls. 

$f20..$f30 saved registers, whose values must be 

preserved across procedure calls. 

Table 7-2. Floating Point Registers 



7.2.2 The Stack Frame 

The compilers classify each routine into one of of the following categories: 

• no n- leaf routines, that is, routines that call other procedures 

• leaf routines, that is, routines that do not themselves execute any pro- 
cedure calls. Leaf routines are of two types: 

o leaf routines that require stack storage for local variables 

o leaf routines that do not require stack storage for local variables. 

You must decide the routine category before determining the calling se- 
quence. 
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C 



( 



( 



To write a program with proper stack frame usage and debugging capabili- 
ties, use the following procedure: 

1. Regardless of the type of routine, you should include a .ent pseudo- 
op and an entry label for the procedure. The .ent pseudo-op is for 
use by the debugger, and the entry label is the procedure name. The 
syntax is: 

.ent procedure_name 
procedure_name : 

2. If you are writing a leaf procedure that does not use the stack, skip to 
step 3. For leaf procedure that uses the stack or non-leaf procedures, 
you must allocate all the stack space that the routine requires. The 
syntax to adjust the stack size is: 

subu $sp, framesize 

where framesize is the size of frame required. Space must be allo- 
cated for: 

• local variables 

• saved general registers. Space should be allocated only for those 
registers saved. For non-leaf procedures, you must save $31, 
which is used in the calls to other procedures from this routine. If 
you use registers $16-$23, you must also save them. 

• saved floating point registers. Space should be allocated only for 
those registers saved. If you use registers $f20-$f30 you must also 
save them. 

• Procedure call argument area. You must allocate the maximum 
number of bytes for arguments of any procedure that you call 
from this routine. 

NOTE: Once you have modified $sp, you should not modify it again 
for the rest of the routine. 

3. Now include a .frame pseudo-op: 

frame framereg, framesize, returnreg 

The virtual frame pointer is a frame pointer as used in other compiler 
systems but has no register allocated for it. It consists of the framereg 
($sp, in most cases) added to the framesize (see step 2 above). Fig- 
ure 7-1 illustrates the stack components. 
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high memory 



virtual frame 
pointer ($fp) 



frame 
offset 



I 



stack 
pointer ($sp) 

(framereg) 



low memory 



argument n 



argument 1 



local & temporaries 



saved registers 

(including retumreg ) 



argument build 



/ framesize 



( 



Figure 7-1. Stack Organization 

The returnreg specifies the register the return address is in (usually 
$31). These usual values may change if you use a varying stack point- 
er or are specifying a kernel trap routine. 

If the procedure is a leaf procedure that does not use the stack, skip 
to step 7. Otherwise you must save the registers you allocated space 
for in step 2. 

To save the general registers, use the following operations: 

.mask bitmask, frameoffset 

sw reg, framesize+frameoffset-N($sp) 



( 
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The .mask directive specifies the registers to be stored and where they 
are stored. A bit should be on in bitmask for each register saved (for 
example, if register $31 is saved, bit 31 should be '1' in bitmask. Bits 
are set in bitmask in little-endian order, even if the machine configu- 
ration is big-endian) . The frameoffset is the offset from the virtual 
frame pointer (this number is usually negative) . N should be for the 
highest numbered register saved and then incremented by four for 
each subsequently lower numbered register saved. For example: 

sw $31 , frames ize+frameof f set ($sp) 

sw $17 , framesize+frameoffset-4($sp) 

sw $16 , f ramesize+f rameof f set-8 ($sp) 

Figure 7-2 illustrates this example. 



high memory 



virtual frame 
pointer ($fp) 



frame 
offset 



I 



stack 
pointer ($sp) 



saved $31 



saved $17 



saved $16 



low memory 



► framesize 



Figure 7-2. Stack Example 
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Now save any floating point registers that you allocated space for in 
step 2 as follows: 

. f mask bitmask, frameoffset 

s.[sd] reg, framesize+frameoffset-N($sp) 

Notice that saving floating point registers is identical to saving general 
registers except we use the .fmask pseudo-op instead of .mask, and 
the stores are of floating point singles or doubles. The discussion re- 
garding saving general registers applies here as well, but remember that 
N should be incremented by 8 for doubles. 

5. This step describes parameter passing: how to access arguments passed 
into your routine and passing arguments correctly to other procedures. 

As specified in step 2, space must be allocated on the stack for all 
arguments even though they may be passed in registers. This provides 
a saving area if their registers are needed for other variables. 

General registers $4-$7 and float registers $fl2, $fl4 must be used for 
passing the first four arguments (if possible) . You must allocate a pair 
of registers (even if it's a single precision argument) that start with an 
even register for floating point arguments appearing in registers. 

In the table below, the W arguments are considered single and dou- 
ble precision floating point arguments, and 'nN' arguments are every- 
thing else. The elipses (...) mean that the rest of the arguments do 
not go in registers regardless of their type. The 'stack* assignment 
means that you do not put this argument in a register. The register 
assignments occur in the order shown in order to satisfy optimizing 
compiler protocols. 



( 



( 



Arguments 



Register Assignments 



(fl, f2, ...) 
(fl, nl, f2, ...) 
(fl, nl, n2, ...) 
(nl, n2, n3, n4, 
(nl, n2, n3, fl, 
(nl, n2, fl, ...) 
(nl, fl, ...) 



fl -> $fl2, f2 -> $fl4 

fl -> $fl2, nl -> $6, f2 -> stack 

fl -> $fl2, nl -> $6, n2 -> $7 

nl -> $4, n2 -> $5, n3 -> $6, n4 -> $7 

nl -> $4, n2 -> $5, n3 -> $6, fl -> stack 

nl -> $4, n2 -> $5, fl -> ($6, $6) 

nl -> $4, fl -> ($6, $7) 



( 



Table 7-3. Arguments 
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6. Next, you must restore registers that were saved in step 4. To restore 
general purpose registers: 

lw reg, framesize+frameof fset-N($sp) 

To restore the floating point registers: 

1. [sd] reg, framesize+frameoffset-N($sp) 

(Refer to step 4 for a discussion of the value of N.) 

7. Get the return address: 

lw $31, framesize+frameof f set ($sp) 

8. Clean up the stack: 

addu $sp, framesize 

9. Return: 

j $31 

10. To end the procedure: 

. end procedurename 



7.2.3 The Shape of Data 

In most cases, high-level language routine and assembly routines commu- 
nicate via simple variables: pointers, integers, booleans, and single- and 
double-precision real numbers. Describing the details of the various high- 
level data structures (arrays, records, sets, and so on) is beyond our scope 
here. If you need to access such a structure as an argument or as a 
shared global variable, refer to the "Learn by Doing" technique described 
at the end of this section. 



7.3 Examples 

This section contains the examples that illustrate program design rules; 
each example shows a procedure written and C and its equivalent written 
in assembly language. 
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Figure 7-3 shows a non-leaf procedure. Notice that it creates a stack- 
frame, and also saves its return address since it must put a new return 
address into register $31 when it invokes its callee: 



float 

nonleaf(i, j) 
int i, *j ; 

{ 

double atof () ; 

int temp; 

temp = i - * j ; 

if (i < *j) temp = -temp; 

return atof (temp) ; 



} 



nonleaf: 



## Create stackframe 

## Save the return address 



( 



.globl nonleaf 
float 

nonleaf (i, j) 
int i, *j ; 

{ 
.ent nonleaf 2 

subu $sp, 24 
sw $31, 20($sp) 
.mask 0x80000000, -4 
.frame $sp, 24, $31 

double atof () ; 

int temp; 

temp = i - * 3 ; 
lw $2, 0($5) 
subu $3, $4, $2 

if (i < *j) temp = -temp; 
bge $4, $2, $32 ## Note: $32 is a label, not a register 
negu $3, $3 



## Return value goes in $f0 
## Restore return address 
## Delete stackframe 
## Return to caller 



( 



## Arguments are in $4 and $5 



32: 








# 10 


reti 


jrn 


atof (temp) ; 




move 




$4, $3 




jal 




atof 




cvt . s 


.d 


$fo, $fo 




lw 




$31, 20($sp) 




addu 




$Sp, 24 




3 




$31 




.end 




nonleaf 



( 



Figure 7-3. Non-Leaf Procedure 
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Figure 7-4 shows a leaf procedure that does not require stack space for 
local variables. Notice that it creates no stackframe, and saves no return 
address: 



mt 






leaf (pi, 


P2) 




int pi 


P2 




{ 






return 


(Pi 


> p2) ? pi : p2; 


} 




.globl leaf 


# 1 




int 


# 2 




leaf(pl, p2) 


# 3 




int pi, p2; 


# 4 




{ 
.ent leaf 2 


leaf: 




.frame $sp, 0, $31 


# 5 




return (pi > p2) ? pi 
ble $4, $5, $32 
move $3 , $4 
b $33 



P2; 



## Arguments in $4 and $5 





b 




$33 




$32: 


move 




$3, $5 




$33: 












move 




$2, $3 


## Return value goes in $2 




3 




#31 


## Return to caller 


# 6 


} 
.end 


leaf 







Figure 7-4. Leaf Procedure Without Stack Space for Local Variables 

Figure 7-5 shows a leaf procedure that requires stack space for local vari- 
ables. Notice that it creates a stack frame, but does not save a return 
address. 



Version 1.0 



Linkage Conventions 7-11 



char 

leaf_storage(i) 
int i ; 

{ 

char a[16] ; 

int j ; 



for (j = 

a[j] = 
for (j = 10; 

a[j] = 'a' 
return a[i] ; 
} 



0; 3 < 10; 
'0' + j ; 

3 < 16; 

+ J; 



j++) 
j++) 



# 

# 
# 

leaf 



$32: 
# 



. globl leaf_jstorage 

1 char 

2 leaf_storage(i) 

3 int i ; 

4 { ## "2" is the lexical level of the 
. ent leaf_storage 2 

_storage: ## procedure. You may omit it. 

subu $ s p, 24 ## Create stackframe 
.frame $sp, 24, $31 

5 char a[16] ; 

6 int 3 ; 
7 

8 for (j = 0; j < 10; j.++) 
sw $0, 4($sp) 
addu $3, $sp, 24 

9 



10 



$33: 
# 



11 



12 



a 


[j] 


= '0 


' + j; 


Iw 




$14, 


4($sp) 


addu 




$15, 


$14, 48 


addu 




$24, 


$3, $14 


sb 




$15, 


-16($24) 


Iw 




$25, 


4($sp) 


addu 




$8, 


$25, 1 


sw 




$8, 


4($sp) 


bit 




$8, 


10, $32 


for 


(j 


= 10 


; 3 < 16; j+4 


li 




$9, 


10 


sw 




$9, 


4($sp) 


a 


[J] 


= 'a 


' ■+ 31 


lw 




$10, 


4($sp) 


addu 




$11, 


$10, 97 


addu 




$12, 


$3, $10 


sb 




$11, 


-16($12) 


lw 




$13 , 


4($sp) 


addu 




$14, 


$13, 1 


sw 




$14, 


4($sp) 


bit 




$14, 


16, $33 


return 


a[i] 


; 


addu 




$15, 


$3, $4 


ibu 




$2, 


-16($15) 


addu 




$sp, 


24 


3 




$31 




.end 






leaf_storage 



## Argument is in $4 

## Return value goes in $2 

## Delete stackframe 

## Return to caller 



( 



( 



Figure 7-5. Leaf Procedure With Stack Space for Local Variables 
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7-4 Learning by Doing 

The rules and parameter requirements required between assembly language 
and other languages are varied and complex. The simplest approach to 
coding an interface between an assembly routine and a routine written in a 
high-level language is to do the following: 

• Use the high-level language to write a skeletal version of the routine 
that you plan to code in assembly language. 

• Compile the program using the — S option, which creates an assembly 
language (.s) version of the compiled source file. 

• Study the assembly-language listing and then, imitating the rules and 
conventions used by the compiler, write your assembly language code. 

The next two sections illustrate techniques to use in creating an interface 
between assembly language and high-level language routines. The exam- 
ples shown are merely to illustrate what to look for in creating your inter- 
face. Details such as register numbers will vary according to the number, 
order, and data types of the arguments. You should write and compile 
realistic examples of your own code in writing your particular interface. 



7.4,1 Calling a High-Level Language Routine 

The following steps show a technique to follow in writing an assembly lan- 
guage routine that calls atof, a routine written in C that converts ASCII 
characters to numbers; for more information, see the atof (3) in the 
IRIS-4D Programmer's Reference Manual. 

1. Write a C program that calls atof. Pass global rather than local vari- 
ables; this makes them to recognize in the assembly language version 
of the C program, (and ensures that optimization doesn't remove any 
of the code on the grounds that it has no effect.) 

Figure 7-6 is an example of a C program that calls atof. 
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char &- LL= "3.1 415 

double d, atof o ; — _ _ ___^ c is declared as a 
fanl o ^-^ Olob.l variable. 

{ 

d = atof (c* 

f = (float) atof (c) ; 

} 




Figure 7-6. C Program that Calls atof 

2. Compile the program using the using the compiler options shown be- 
low: 

cc -S -O caller. c 

The —S option causes the compiler to produce the assembly-language 
listing; the — O option, though not required, reduces the amount of 
code generated, making the listing easier to read. 

3. After compilation, look at the file caller. s (shown below). The high- 
lighted section of the listing shows how the parameters are passed, the 
execution of the call, and how the returned values are retrieved. 



( 



( 
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la $4, c ## load address of c 








jal atof ## call atof 








s.d $f0, d f = (float) atof(c); 




# 


7 


## store result in d 
la $4, c ## load address of c 
jal atof ## call atof 








cvt.s.d $f4, $f0 ## convert double result to float 
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, end caller . . 





Figure 7-7. Compilation Listing 
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7.4.2 Calling an Assembly Language Routine 

This section shows a technique to follow in writing an assembly language 
routine that calls a routine written in a high-level language (Pascal is used 
in this example) . 

1. Write a facsimile of the assembly language routine you wish to call. In 
the body of the routine, write statements that use the same arguments 
you intend to use in the final assembly language routine. Copy the 
arguments to global variables rather than local variables to make it 
easy for you to read the resulting assembly language listing. 

Figure 7-8 shows the Pascal facsimile of the assembly language pro- 
gram. 



type 




str = packed array [1 . . 10] of char; 




subr = 2 . . 5 ; 




var 




global__r: real; 




global_c: subr; 




global_s: str; 




global_b: boolean; 




function callee(var r: real; c: subr; s: str): 


boolean; 


begin 




global_r := r; 




global_c := c; 




global_s := s; 




callee := c = 3; 




end ; 





Figure 7-8. Pascal Facsimile 

2. Compile the program using the using the compiler options shown be- 
low: 

cc -S -0 caller. c 

The — S option causes the compiler to produce the assembly-language 
listing; the —O option, though not required, reduces the amount of 
code generated, making the listing easier to read. 

3. After compilation, look at the file caller.s (shown below). The high- 
lighted section of the listing shows how the parameters are passed, the 
execution of the call, and how the returned values are retrieved. 
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c a lie e s;;!: : ; : / - .;• • • '•;. • . . . '•.>; 

.. v."-:f r ame : .' x: $ s'p';; : : : i;Q i • • '$ 3 i . ?\: 

. ; : :: . : ,' S W y • ; ••;:•' . $5'v, • 4 ( $ S p ) : > 

;'.'•; 'sw/ ••,:•;' :/-$6.p' 8($sp;> ;•' : - 



lbu 



$3, 4($Sp) 



## Get subrange c, masking it to 8 bits 



-arid' "'••' .;• • •• ' $ 3 ;v $ 3' % : ;25 5 • 
;il ••:;::. ; : :;begirv. ; 
•12' ,' : - ./ijgiobal^r : =.-ri': 



l.s 



$f4, 0($4) 



## The pointer to "r" is in 0($4) 



;•";:. .->.-; s";.;j :••;;.;:'•; $f 4v : - global_r 
•• : # • •••• i : 3. '••' : : : : : global^c ' ;•■% • c ; • 

: : :'.-.X:-. J-,,.- .", : -:$3 ^-global^V.:. 

• \S O . •: •;'•' 

:''#;'• : 14 : " '...: :- : "r~ glbbal^s- ,.: =• s; : - •; 



$32: 



la 


$14, 


global_s 


addu 


$15, 


$sp, 8 


. set 


noat 


addu 


$24, 


$15, 10 


lbu 


$1, 


0($15) 


addu 


$15, 


$15, 2 


sb 


$1, 


0($14) 


lbu 


$1, 


-1($15) 


addu 


$14 


$14, 2 


sb 


$1, 


-1($14) 



## For array "s", the caller gives you a 
## pointer at 8($sp). If you want to use 
## it as a call-by-value argument just as 
## Pascal does (that is, if you want to 
## be able to modify a local copy without 
## affecting the global copy) then you 
## must copy it into your stack frame as 
## shown here (the code enclosed in ".set 
## noat" is a tight byte-copying loop) . 
## Otherwise, you may simply use the 
## pointer provided to you. 



; : ; : bri;e);V •; : ;/;''$l : 5; . :#24/, • $32 

••'. '•:•• ■ : - : '^fe : ' : .": ^ : : :' ''.'*&& '.' 

•I £j • ";. .;•'•;■ •'; ' • .ical j ee • : = ':•'' c • ' =' 3 :j 

" i; ' -andi;/:;. If; $5 : •• :■ $5 ', ', : 2$ 5 ' - ' . 



and 



$2, $5, 255 



## Return the boolean by leaving it in $2 



:','AV 

'. .end. 



Wm 



Figure 7-9. Compilation Listing 
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7.5 Memory Allocation 

The machine's default memory allocation scheme gives every process two 
storage areas, that can grow without bound. A process exceeds virtual 
storage only when the sum of the two areas exceeds virtual storage space. 
The link editor and assembler use the scheme shown in Figure 7-10; an 
explanation of each area in the alocation scheme follows the figure. 



Oxffffffff 
OxSfffffff 


Reserved for Kernel 
(accessible from Kernel Mode) 
(2GB) 


0x7fffffff 
0x7ffff000 


Not Accessible 
(by convention, not a hardware 
implementation) 
(4KB) 


0x7fffefff 


Activation Stack 
(grows toward zero) 


$sp ""•*' 


Protected 
(grows from either edge) 




Heap 
(grows up) 




.bss 




.sbss 


$gp ->* 


.sdata 




.data 


0x10000000 


.rdata 


Oxfffffff 


Reserved for 
Shared Libraries 




Not Used 


0x400000 


Program .text 
(including header) 


0x3fffff 


Reserved 
(4MB) 



n 

m 

m 
m 



( 



m 



m 



m 



( 



Figure 7-10. Memory Layout (User Program View) 
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[T| Reserved for kernel operations. 

[21 Reserved for operating system use. 

[31 Used for local data in C programs. 

[41 Not allocated until a user requests it, as in System V shared mem- 
ory regions. 

[51 The heap is reserved for sbrk and break system calls, and it not 
always present. 

[6l The machine divides all data into one of five sections: 



bss —Uninitialized data with a size greater than the value 
specified by the -G command line option. 

sbss —Data less than or equal to the -G command line op- 
tion. (512 is the default value for the -G option.) 

sdata (small data)— Data initialized and specified for the sdata 
section. 

data (data)— Data initialized and specified for the data sec- 
tion. 

rdata (read-only data)— Data initialized and specified for the 
rdata section. 



Reserved for any shared libraries. 
Contains the .text section 
Reserved. 
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8. Pseudo Op-Codes 



The keywords in Chapter 8 describe pseudo op-codes (directives). These 
pseudo op-codes influence the assembler's later behavior. In the text, 
boldface type specifies a keyword and italics represents an operand that 
you define. 

The assembler has the pseudo op-codes shown in Table 8-1. 



Pseudo-Op 



Description 



.align expression 



Advance the location counter to make the 
expression low order bits of the counter 
zero. 



Normally, the .half, .word, .float, and 
.double directives automatically align their 
data appropriately. For example, .word 
does an implicit .align 2 (.double does a 
.align 3). You disable the automatic 
alignment feature with .align 0. The as- 
sembler reinstates automatic alignment at 
the next .text, .data, .rdata, or .sdata 
directive. 

Labels immediately preceding an automatic 
or explicit alignment are also realigned. 
For example, foo: .align 3; .word is the 
same as .align 3; foo: .wordO . 



Table 8-1 . Pseudo Op-Codes 
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Pseudo-Op 



Description 



.ascii string [, string]. 



asciiz string [, string].. 



Assembles each string from the list into 
successive locations. The .ascii directive 
does not null pad the string. You MUST 
put quotation marks (") around each 
string. You can use the backslash escape 
characters. For a list of the backslash 
characters, see Chapter 4. 

Assembles each string in the list into suc- 
cessive locations and adds a null. You can 
use the backslash escape characters. For a 
list of the backslash characters, see Chap- 
ter 4. 



c 



.asmO Tells the assembler's second pass that this 

assembly came from the first pass. (For 
use by compilers.) 

.bgnb symno (For use by compilers.) Sets the beginning 

of a language block. The .bgnb and 
.endb directives delimit the scope of a 
variable set. The scope can be an entire 
procedure, or it can be a nested scope (for 
example a "{}" block in the C language). 
The symbol number symno refers to a 
dense number in a .T file. To set the end 
of a language block, see .endb. 

.byte expression! [., expression! ]... 

[, expressionN] Truncates the expressions from the 

comma-separated list to 8 -bit values, and 
assembles the values in successive loca- 
tions. The expressions must be absolute. 
The operands can optionally have the 
form: expression! [ : expression! ]. The 
expression! replicates expression! 's value 
expression! times. 



Table 8-1. Pseudo Op-Codes (continued) 
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Pseudo-Op 



Description 



.comm name, expression 



Unless defined elsewhere, name becomes a 
global common symbol at the head of a 
block of expression bytes of storage. The 
linker overlays like-named common 
blocks, using the maximum of the expres- 
sions. 



.data 



Tells the assembler to add all subsequent 
data to the data section. 



.double expression [ , expression! ] ... 

/, expressionN Initializes memory to 64-bit floating point 

numbers. The operands can optionally 
have the form: expression! [ : expression! 
] . The expression! is the floating point 
value. The optional expression! is a non- 
negative expression that specifies a repeti- 
tion count. The expression! replicates ex- 
pression! 's value expression! times. This 
directive automatically aligns its data and 
any preceding labels to a double-word 
boundary. You can disable this feature by 
using .align 0. 



.end [procjiame] 



.endb symno 



.endr 



.ent procjiame 



Sets the end of a procedure. Use this di- 
rective when you want to generate informa- 
tion for the debugger. To set the begin- 
ning of a procedure, see .ent. 

Sets the end of a language block. To set 
the beginning of a language block, see 
.bgnb. 

Signals the end of a repeat block. To start 
a repeat block, see .repeat. 

Sets the beginning of the procedure 
procjiame. Use this directive when you 
want to generate information for the 
debugger. To set the end of a procedure, 
see .end. 
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Pseudo-Op 



Description 



.extern name expression 



.err 



.file filejnumber 
file jtamejst ring 



name is a global undefined symbol whose 
size is assumed to be expression bytes. 
The advantage of using this directive, in- 
stead of permitting an undefined symbol to 
become global by default, is that the as- 
sembler can decide whether to use the 
economical $gp-relative addressing mode, 
depending on the value of the -G option. 
As a special case, if expression is zero, the 
assembler refrains from using $gp to ad- 
dress this symbol regardless of the size 
specified by -G. 

Signals an error. Any compiler front-end 
that detects an error condition puts this 
directive in the input stream. When the 
assembler encounters a .err, it quietly 
ceases to assemble the source file. This 
prevents the assembler from continuing to 
process a program that is incorrect. (For 
use by compilers.) 



Specifies the source file corresponding to 
the assembly instructions that follow. (For 
use by compilers.) 



( 



.float expression! [ , expression! ]... [, expressionN ] 

Initializes memory to single precision 
32-bit floating point numbers. The oper- 
ands can optionally have the form: expres- 
sion! [ : expression! ] . The optional ex- 
pression! is a non-negative expression that 
specifies a repetition count. This optional 
form replicates expressionVs value expres- 
sion! times. This directive automatically 
aligns its data and preceding labels to a 
word boundary. You can disable this fea- 
ture by using .align 0. 

Table 8-1 . Pseudo Op-Codes (continued) 
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Pseudo-Op 



Description 



.fmask mask offset 



.frame frame-register offset 
return_pc_jegister 



Sets a mask with a bit turned on for each 
floating point register that the current rou- 
tine saved. The least-significant bit corre- 
sponds to register $f0. The offset is the 
distance in bytes from the virtual frame 
pointer at which the floating point registers 
are saved. The assembler saves higher 
register numbers closer to the virtual frame 
pointer. You must use .ent before .fmask 
and only one .fmask may be used per 
.ent. Space should be allocated for those 
registers specified in the .fmask. 



Describes a stack frame. The first register 
is the frame-register, the offset is the dis- 
tance from the frame register to the virtual 
frame pointer, and the second register is 
the return program counter (or, if the first 
register is $0, this directive shows that the 
return program counter is saved four bytes 
from the virtual frame pointer) . You must 
use .ent before .frame and only one 
.frame may be used per .ent. No stack 
traces can be done in the debugger without 
.frame. 



.globl name 



Makes the name external. If the name is 
otherwise defined (by its appearance as a 
label), the assembler will export the sym- 
bol; otherwise it will import the symbol. 
In general, the assembler imports unde- 
fined symbols (that is, it gives them the 
UNIX storage class "global undefined" and 
requires the linker to resolve them) . 
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Pseudo-Op 



Description 



.half expression! [ 



expression! ] ... [ , expressionN] 

Truncates the expressions in the comma- 
separated list to 16-bit values and assem- 
bles the values in successive locations. 
The expressions must be absolute. This 
directive can optionally have the form: ex- 
pression! [ : expression! ]. The expres- 
sion! replicates expressionVs value expres- 
sion! times. This directive automatically 
aligns its data appropriately. You can dis- 
able this feature by using .align 0. 



( 



.lab label name 



Associates a named label with the current 
location in the program text. (For use by 
compilers) . 



.lcomm name, expression 



Makes the name's data type bss. The as- 
sembler allocates the named symbol to the 
bss area, and the expression defines the 
named symbol's length. If a .globl direc- 
tive also specifies the name, the assembler 
allocates the named symbol to external 
bss. The assembler puts bss symbols in 
one of two bss areas. If the defined size 
is smaller than the size specified by the 
assembler or compiler's -G command line 
option, the assembler puts the symbols in 
the sbss area and uses $gp to address the 
data. 



( 
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Pseudo-Op 



Description 



.loc filejiumber 
line number 



Specifies the source file and the line within 
that file that corresponds to the assembly 
instructions that follow. The assembler 
ignores the file number when this directive 
appears in the assembly source file. Then, 
the assembler assumes that the directive 
refers to the most recent .file directive. 
When a .loc directive appears in the bi- 
nary assembly language .G file, the file 
number is a dense number pointing at a 
file symbol in the symbol table .T file. 
For more information about .G and .T 
files, see the IRIS-4D Series Compiler 
Guide. 



.mask mask offset 



nop 



Sets a mask with a bit turned on for each 
general purpose register that the current 
routine saved. Bit one corresponds to reg- 
ister $1. The offset is the distance in 
bytes from the virtual frame pointer where 
the registers are saved. The assembler 
saves higher register numbers closer to the 
the virtual frame pointer. Space should be 
allocated for those registers appearing in 
the mask. If bit zero is set it is assumed 
that space is allocated for all 31 registers 
regardless of whether they appear in the 
mask. (For use by compilers) . 

Tells the assembler to put in an instruction 
that has no effect on the machine state. 
While several instructions cause no-opera- 
tion, the assembler only considers the ones 
generated by the nop directive to be wait 
instructions. This directive puts an explicit 
delay in the instruction stream. 
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Pseudo-Op 



Description 



.option options 



Tells the assembler that certain options 
were in effect during compilation. (These 
options can, for example, limit the assem- 
bler's freedom to perform branch optimiza- 
tions.) This option is intended for com- 
piler-generated .s files rather than for 
hand-coded ones. 



( 



.repeat expression 



Repeats all instructions or data between the 
.repeat directive and the .endr directive. 
The expression defines how many times 
the data repeats. With the .repeat direc- 
tive, you CANNOT use labels, branch in- 
structions, or values that require relocation 
in the block. To end a .repeat, see 
.endr. 



.rdata 



Tells the assembler to add subsequent data 
into the rdata section. 



.sdata 



Tells the assembler to add subsequent data 
to the sdata section. 



( 



Table 8-1. Pseudo Op-Codes (continued) 
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Pseudo-Op Description 



.set option Instructs the assembler to enable or to dis- 

able certain options. Use set options only 
for hand-crafted assembly routines. The 
assembler has these default options: reor- 
der, macro, and at. You can specify only 
one option for each .set directive. You 
can specify these .set options: 



• The reorder option lets the assembler 
reorder machine language instructions 
to improve performance. 

• The noreorder option prevents the as- 
sembler from reordering machine lan- 
guage instructions. If a machine lan- 
guage instruction violates the hardware 
pipeline constraints, the assembler is- 
sues a warning message. 

• The macro option lets the assembler 
generate multiple machine instructions 
from a single assembler instruction. 

• The nomacro option causes the assem- 
bler to print a warning whenever an 
assembler operation generates more 
than one machine language instruction. 
You must select the noreorder option 
before using the nomacro option; oth- 
erwise, an error results. 

• The at option lets the assembler use 
the $at register for macros, but gener- 
ates warnings if the source program 
uses $at. 
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Pseudo-Op 



Description 



• When you use the noat option and an 
assembler operation requires the $at 
register, the assembler issues a warning 
message; however, the noat option 
does let source programs use $at with- 
out issuing warnings. 

• The nomove options tells the assem- 
bler to mark each subsequent instruc- 
tion so that it cannot be moved during 
reorganization. Because the assembler 
can still insert nop instructions where 
necessary for pipeline constraints, this 
option is less stringent than noreorder. 
The assembler can still move instruc- 
tions from below the nomove region to 
fill delay slots above the region or vice 
versa. The nomove option has part of 
the effect of the "volatile" C declara- 
tion; it prevents otherwise independent 
loads or stores from occurring in a dif- 
ferent order than intended. 

• The move option cancels the effect of 
nomove. 



( 



( 



.space expression 



.struct expression 



Advances the location counter by the value 
of the specified expression bytes. The as- 
sembler fills the space with zeros. 

This permits you to lay out a structure us- 
ing labels plus directives like .word, .byte, 
and so forth. It ends at the next segment 
directive (.data, .text, etc.). It does not 
emit any code or data, but defines the la- 
bels within it to have values which are the 
sum of expression plus their offsets from 
the .struct itself. 



( 
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Pseudo-Op 



Description 



(symbolic equate) 



Takes one of these forms: name = expres- 
sion or name = register. You must define 
the name only once in the assembly, and 
you CANNOT redefine the name. The 
expression must be computable when you 
assemble the program, and the expression 
must involve operators, constants, and 
equated symbols. You can use the name 
as a constant in any later statement. 

Tells the assembler to add subsequent code 
to the text section. (This is the default.) 

Specifies the major and minor version 
numbers (for example, version 0.15 would 
be .verstamp 15). 

(For use by compilers) . Describes a regis- 
ter variable by giving the offset from the 
virtual frame pointer and the symbol num- 
ber symno (the dense number) of the sur- 
rounding procedure. 

.word expression! [, expression! ] ... / , expressionN] 

Truncates the expressions in the comma- 
separated list to 32-bits and assembles the 
values in successive locations. The expres- 
sions must be absolute. The operands can 
optionally have the form: expression! [ : 
expression! ]. The expression! replicates 
expressionVs value expression! times. 
This directive automatically aligns its data 
and preceding labels to a word boundary. 
You can disable this feature by using 
.align 0. 



.text 



.verstamp major minor 



.vreg register offset symno 
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9. Object File Format 



This chapter provides information on the object file format and has the 
following major topics: 

• An overview of the components that make up the object file, and the 
differences between the object-file format and the UNIX System V 
common object file format (COFF) . 

• A description of the headers and sections of the object file. Detailed 
information is given on the logic followed by the assembler and link 
editor in handling relocation entries. 

• The format of the three types of object files (OMAGIC, NMAGIC, 
and ZMAGIC), and information used by the system loader in loading 
object files at run- time. 

• Archive files and link editor defined symbols. 



9.1 Overview 



The assembler and the link editor generate object files in the order shown 
in Figure 9-1. Any areas empty of data are omitted, except that the Op- 
tional Header is always present. 



Version 1.0 Object File Format 9-1 



The fields of the Symbol table portion (indicated in Figure 9-1) that ap- 
pear in the final object file format vary, as follows: 

• The Line Numbers, Optimization Symbols, and Auxiliary Symbols 
subtables appear only when debugging is on (when the user specifies 
one of the compiler — gl, — g2 or — g3 options ). 

• When the user specifies the — x option (strip non-globals) for the link 
edit phase, the link editor strips the Local Symbols, Optimization 
Symbols, Auxiliary Symbols subtables from the object file, and updates 
the Procedure Descriptor table. 

• The link editor strips the entire Symbol table from the object file 
when the user specifies the — s option (strip) for the link edit phase. 

Any new assembler or link editor designed to work with the compiler sys- 
tem should lay out the object file in the order shown in Figure 9-1. The 
link editor can process object files that are ordered differently, but per- 
formance may be degraded. 



( 
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Created only if 
debugging is on 



Missing if stripped of 
non-globals 




File Header 



Optional Headers 



Section Headers 



Section Data 



text 
read-only data 

large data 

small data 
small bss (0 size) 
large bss (0 size) 



Section Relocation 
Information 



text 
read-only data 
large data 
small data 



Symbolic Header 



Une Numbers- 



Procedure Descriptor Table 



||SI||llyri|oi81 



I : ^ptirnfzatioh "■ Sympl^ ; : 



v#^ 



l|||||lpng|:i 



External Strings 



File Descriptor 



Relative File; pesbrip^; 



External Symbols 



Symbol table. 
Missing if fully 
stripped 



Figure 9-1. Object Files 
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Readers already familiar with standard UNIX System V COFF (common 
object file format) may be interested in the difference between it and the 
compiler system format, as described next. 

The compiler system File Header definition is based on UNIX System V 
header file filehdr.h with the following modifications. 

• The symbol table file pointer and the number of symbol table entries 
now specify the file pointer and the size of the Symbolic Header 
respectively (described in Chapter 10). 

• All tables that specify symbolic information have their file pointers and 
number of entries in this Symbolic Header. 

The Optional Header definition has the same format as the UNIX System 
V header file aouthdr.h (the standard UNIX system a.outheader) except 
the following fields have been added: bssjstart, gprmask, cprmask, and 
gp_yalue. See Table 9-4. 

The Section Header definition has the same format as the UNIX System 
V's header file scnhdr.h. except the line number fields (sjnnoptr and 
sjilnno) aren't used. See Table 9-6. 

The relocation information definition is similar to Berkeley UNIX, which 
has ' 'local' ' relocation types; however, you should read the topic Section 
Relocation Information in this chapter to be aware of differences that do 
exist. 



9.2 The File Header 



Table 9-1 shows the format of the File Header; the header file filehdr.h 
contains its definition. 



( 
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Declaration 



Field 



Description 



unsigned short 

unsigned short 

long 

long 

long 

unsigned short 

unsigned short 



f__magic; magic number 

fjiscns; number of section 

f_timdat; time and date stamp 

f__symptr; file pointer to symbolic header 

f__nsyms; size of symbolic header 

f_opthdr; size of optional header 

f_flags; flags 



Table 9-1. File Header Format 



The fjsymptr points to the Symbolic Header of the Symbol table, and the 
fjisyms gives the size of the header. For a description of the Symbolic 
Header, see Chapter 10. Other fields in the File Header are described in 
the sections that follow. 



9.2.1 File Headier Magic Field (fjnagic) 

The magic number in the fjnagic entry in the File Header specifies the 
target machine on which an object file can execute. Table 9-2 shows the 
values and mnemonics for the magic numbers; the header file filehdr.h 
contain the preprocessor macro definitions. 
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Symbol 



Value 



Description 



MIPSEBMAGIC 0x0160 



MIPSELMAGIC 0x0 1 62 



SMIPSEBMAGIC 0x6001 



SMIPSELMAGIC 0x6201 



big-endian target (headers and tables 
have same byte sex as host machine. 

little-endian target (headers and tables 
have same byte sex as host machine.) 

big-endian target (headers and tables 
have opposite byte sex as host machine.) 

little-endian target (headers and tables 
have opposite byte sex as host machine) 



Table 9-2. File Header Magic Numbers 



9.2.2 Flags (f Jlags) 

The f Jlags field describes the object file characteristics. Table 9-3 de- 
scribes the flags and gives their hexadecimal bit patterns. The table notes 
those flags that don't apply to compiler system object files. 



( 
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Symbol 



Value Description 



F_RELFLG 0x0001 

F_EXEC 0x0002 

FJLNNO 0x0004 

F_LSYMS 0x0008 

FJMINMAL 0x0010 

F_UPDATE 0x0020 

FJSWABD 0x0040 

F_AR16WR 0x0080 

F_AR32WR 0x0100 

F AR32W 0x0200 



F_J>ATCH 
F NODF 



0x0400 
0x0400 



relocation information stripped from file 

file is executable (i.e. no unresolved 
external references) 

line numbers stripped from file 

local symbols stripped from file 

1 minimal object file (".m") output of f extract 

1 fully bound update file, output of ogen 

1 file whose bytes were swabbed (in names) 

1 file has the byte ordering of an AR16WR 
(e.g. 11/70) machine (it was created there, 
or was produced by conv) 

file has the byte ordering of an AR32WR 
machine (e.g. vax) 

1 file has the byte ordering of an AR32W machine 
(e.g. 3b,maxi,MC68000) 

1 file contains "patch" list in Optional Header 

1 (minimal file only) no decision functions for 
replaced functions. 



1 Not used by compiler system object modules. 



Table 9-3. File Header Flags 
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9-3 Optional Header 



The link editor and the assembler fill in the Optional Header, and the 
system (kernel) loader (or other program that loads the object module at 
run-time) uses the information it contains, as described in the section 
Loading Object Files in this chapter. 



( 



Table 9-4 shows the format of the Optional Header; the header file 
aouthdr.h contains its definition. 



Symbol 


Value 


Description 


short 


magic; 


See Table 9.5. 


short 


vstamp; 


version stamp 


long 


tsize; 


text size in bytes, padded to double-word 
boundary 


long 


dsize; 


initialized data in bytes, padded to 
double-word boundary 


long 


bsize; 


uninitialized data in bytes, padded to 
double-word boundary 


long 


entry; 


entry point 


long 


text_start; 


base of text used for this file 


long 


data_start; 


base of data used for this file 


long 


bss_start; 


base of bss used for this file 


long 


gprmask; 


general purpose register mask 


long 


cprmask[4]; 


co-processor register masks 


long 


gp_value; 


the gp value used for this object 



( 



Table 9-4. Optional Header Definition 



( 



The next section describes the magic field in the Optional Header. 
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9.3.1 Optional Header Magic Field (magic) 

Table 9-5 shows the values of the magic field for the Optional Header; 
the header file aouthdr.h contains the preprocessor macro definitions. 



Symbol Value 



Description 



OMAGIC 0407 



NMAGIC 0410 



ZMAGIC 0413 



Impure Format. The text is not write-protected or 
sharable; the data segment is contiguous with the 
text segment. 

Shared Text. The data segment starts at the next 
page following the text segment and the text seg- 
ment is write-protected. 

Demand Paged. The object file is to be demand 
loaded (each page is copied into memory directly 
from load module on the first reference to that 
page), the file has a special format, and the text 
and data segments start at fixed addresses. The 
text segment is write-protected (the default). 



Table 9-5. UNIX Magic Numbers 



See the Object Files section in this chapter for information on the format 
of OMAGIC, NMAGIC, and ZMAGIC files. 



9.4 Section Headers 



Table 9-6 shows the format of the Section Header; the header file 
scnhdr.h. contains the definition. 
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Declaration 



Field 



Description 



char 


s__name[8]; 


long 


sjpaddr; 


long 


s vaddr; 


long 


sjsize; 


long 


sjscnptr; 


long 


s___relptr; 


long 


s_Jnnoptr; 


unsigned short 


s_nreloc; 


unsigned short 


s_nlnno; 


long 


s_flags; 



section name 

physical address 

virtual address 

section size 

file pointer to raw data for section 

file pointer to relocation 

file pointer to line numbers (not used) 

number of relocation entries 

number of line number entries (not used) 

flags 



( 



Table 9-6. Section Header Format 

The sections that follow describe in detail some of the entries in the Sec- 
tion Header. 



9.4.1 Section Name (sjiame) 

Table 9-7 shows the constants for section names that can appear in the 
s_jiame field of the Section Header; the header file scnhdr.h contains the 
preprocessor macro definitions. 



( 



Declaration 


Field 


Description 


TEXT 


".text" 


text section 


RDATA 


".rdata" 


read only data section 


DATA 


".data" 


large data section 


SDATA 


".sdata" 


small data section 


BSS 


".bss" 


large bss section 


SBSS 


".sbss" 


small bss section 



Table 9-7 B Section Header Constants for Section Names 



( 
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9.4.2 Line Number Entries (sjnnoptr and sjilnno) 

The link editor fills the line number fields with zeros. The compiler and 
link editor don't use the line number entries in the Section Header, be- 
cause line number information appears only once in the object file, not 
once per section. The compiler instead places line numbers in the Line 
Numbers subtables of the Symbol table, as described in Chapter 10. 



9.4.3 Flags (sJFIags) 

Table 9-8 shows the flags that appear in s_Jlags and notes those flags that 
apply to compiler system object files; the header file scnhdr.h contains 
their definition. 



Symbol 



Value Description 



STYP REG 


0x00 


STYP DSECT 


0x01 


STYP NOLOAD 


0x02 


STYP GROUP 


0x04 


STYP J> AD 


0x08 


STYP COPY 


0x10 



STYP TEXT 


0x20 


STYP RDATA 


0x100 


STYP DATA 


0x40 


STYP SDATA 


0x200 


STYP SBSS 


0x80 


STYP BSS 


0x400 



regular section; allocated, relocated, loaded 

Mummy; not allocated, relocated, not loaded 

1 noload; allocated, relocated, not loaded 

1 grouped; formed of input sections 

1 padding; not allocated, not relocated, 

loaded 

1 copy; for decision function used by field 

update; not allocated, not relocated, 

loaded; relocated, and line number entries 

processed normally 

text only 

read only data only 

data only 

small data only 

contains small bss only 

bss only 



1 Not used by compiler system object modules. 



Table 9-8. Format of s_flags Section Header Entry 



Version 1.0 



Object File Format 9-11 



NOTE: For performance reasons, the link editor uses the sjlags entry 
instead of $_name to determine the type of section. However, the link 
editor does correctly fill in the sjiame entry. 



9.5 Section Data 



A section represents the smallest portion of the object file; compiler 
systme files are represented in six sections: .text, .rdata (read-only data), 
.data (data) 1 .sdata (small data), .sbss (small block started by storage), 
and .bss (block started by storage). The .text section contains the ma- 
chine instructions that are to be executed; the .rdata, .data, and .sdata 
contain initialized data; and the .sbss, and .bss sections reserve space for 
uninitialized data that is created by the kernel loader for the program be- 
fore execution and filled with zeros. 

Figure 9-2 shows the layout of the six sections. 



.text 



.rdata 



.data 



.sdata 



.sbss 



.bss 



text segment 



data segment 



bss segment 



( 



Figure 9-2. Organization of Section Data 



As noted in Figure 9-2, the sections are grouped into the text segment 
(containing the .text section), the data segment (.rdata, .data, and 
s.data) and the bss segment (.sbss and .bss). A section is described in 
and referenced through the Section Header, and segments through the 
Optional Header. 



( 
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The link editor references the data shown in Figure 9-2 both as sections 
and segments, through the Section Header and Optional Header respec- 
tively. However, the system (kernel) loader, when loading the object file 
at run-time, references the same data only by segment, through the Op- 
tional Header. 



9.6 Section Relocation Information 

This portion of the chapter is divided into the following parts: 

• The format of a relocation table entry and an explanation of its fields. 

• The logic followed by the assembler and the link editor is creating and 
updating an entry. 



9.6.1 Relocation Table Entry 

Table 9-9 shows the format of an entry in the Relocation Table; the 
header file reloc.h contains the definition. 

Declaration Field Description 

long r_vaddr; (virtual) address of an item to be relocated, 

unsigned rjsymndx:24, index into external symbols or section 

number; see r_extern below. 
r__reserved:3, 

retype: 4, relocation type 

r__extern:l; = 1 for an external relocation entry; 

r__symndx is an index into External Symbols. 

= for a local, relocation entry; r_symndx 

is the number of the section containing the 

symbol. 



Table 9-9. Format of a Relocation Table Entry 
The sections that follow describe some of the fields shown in Table 9-9. 
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Symbol Index (r_symndx) and Extern Field (r_extern) 

For external relocation entries, rjextern is set to 1 and r_symnndx is the 
index into External Symbols for this entry. In this case, the value of the 
symbol is used as the value for relocation. 

For local relocation entries, rjextern is set to 0, and rjsymndx contains a 
constant that refers to a section. In this case, the starting address of the 
section to which the constant refers is used as the value for relocation. 

Table 9-10 gives the section numbers for rjsyrnndx; the reloc.h file con- 
tains their preprocessor macro definitions. 



( 



Symbol 


Value 


Description 


R SN TEXT 


1 


.text section 


R SN RDATA 


2 


.rdata section 


R SN DATA 


3 


.data section 


R SN SDATA 


4 


.sdata section 


R SN SBSS 


5 


.sbss section 


RJSN_BSS 


6 


.bss section 



Table 9-10. Section Numbers for Local Relocation Entries 



Relocation Type (rjtype) 

Table 9-11 shows valid symbolic entries for the relocation type (rjype) 
field; the header file reloc.h contains their preprocessor macro defini- 
tions. 



( 



( 
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Symbol 



Value Description 



R_ABS 


0x0 


R_REFHALF 


0x1 


R_REFWORD 


0x2 


R_JMPADDR 


0x3 


RJREFHI 


0x4 


RJREFLO 


0x5 


R GPREL 


0x6 



relocation already performed. 

16-bit reference to the symbol's virtual address 

32-bit reference to the symbol's virtual address 

26-bit jump reference to the symbol's virtual 
address 

reference to the high 16-bits of symbol's 
virtual address 

reference to the low 16-bits of symbol's 
virtual address 

reference to the offset from the global pointer 
to the symbol's virtual address 



Table 9-11. Relocation Types 



9.6.2 Assembler and Link Editor Processing 

Compiler system executable object modules with all external references 
defined, have the same format as relocatable modules and are executable 
without re-link editing. 

Local relocation entries must be used for symbols that are defined. 
Therefore, external relocations are used only for undefined symbols. Fig- 
ure 9-3 gives an overview of the Relocation Table entry for an undefined 
external symbol. 
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Relocation Table Entry 










r vaddr 


txienn Dvrnpois 












r_symndx 


















value=0 


r extem=1 
















Section Data 




Sign-extended to 32 bits 


iitei 








pp 


| constant 



















( 



Figure 9-3. Relocation Table Entry for Undefined External Symbols 

The assembler creates this entry as follows: 

• Sets r_jaddr to point to the item to be relocated. 

• Places a constant to be added to the value for relocation at the 
address for the item to be relocated (r_yaddr) . 

• If the width of the constant is less than 32 bits, sign-extends the 
constant to 32 bits. 

• Adds the value for relocation (the value of the symbol) to the 
constant and places it back in the address to be relocated. 

• Sets rjsymndx to the index of the External Symbols entry that 
contains the symbol value. 

• Sets rjype to the constant for the type of relocation types. Table 
9-11 shows the valid constants for the relocation type. 

• Sets r_extern to 1. 

NOTE: The assembler always sets the value of the undefined entry in 
External Symbols to 0. It may assign a constant value to be added to the 
relocated value at the address where the location is to be done. If the 



( 



( 
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width of the constant is less than a full word, and an overflow occurs after 
relocation, the link editor flags this as an error. 

When the link editor determines that an external symbol is defined, it 
changes the Relocation Table entry for the symbol to a local relocation 
entry. Figure 9-4 gives an overview of the new entry. 



Relocation Table Entry 



r vaddr 



rsymndx 



retype 



r extern=0 



Section Data 



constant 



Sign-extended to 32 bits 



Section n Optional 
^ Header 











s vaddr 




















^ 


Section n Data 












symbol location 











Figure 9-4. Relocation Table Entry for a Local Relocation Entry 



To change this entry from an external relocation entry to a local reloca- 
tion entry, the link editor: 

• Picks up the constant from the address to be relocated (r_yaddr) . 

• If the width of the constant is less than 32 bits, sign-extends the 
constant to 32 bits. 

• Adds the value for relocation (the value of the symbol) to the 
constant and places it back in the address to be relocated. 

• Sets rjsymndx to the section number that contains the external 
symbol. 

• Sets r extern to 0. 



Version 1.0 



Object File Format 9-17 



Examples 

The examples that follow use external relocation entries. 

Example 1: 32-Bit Reference— R^REFWORD. This example shows 
assembly statements that set the value at location b to the global data 
value y. 

. globl y 

.data 
b: .word y #R_REFWORD relocation type at address b for symbol y 

In processing this statement, the assembler generates a relocation entry of 
type R_REFWORD for the address b and the symbol y\. After determining 
the address for the symbol y, the loader adds the 32-bit address of y to 
the 32-bit value at location b, and places the sum in location b. The 
loader handles 16-bit addresses (R_REFHALF) in the same manner, ex- 
cept it checks for overflow after determining the relocation value. 

Example 2: 26-Bit Jump— RJMPADDR. This example shows assem- 
bly statements that call routine x from location c. 

.text 
x: ;routine x 

c: jal x #R_JMPADDR relocation type at address c for symbol x 

In processing these statements, the assembler generates a relocation entry 
of type R_JMPADDR for the address c and the symbol jc. After deter- 
mining the address for the routine, the loader shifts the address right two 
bits, adds the low 26 bits of the result to the low 26 bits of the instruction 
at address c, and places the results back into the low 26 bits at address c. 

R__JMPADDR relocation entries are produced for the assembler's j (Jump) 
and jal (Jump and Link) instructions. These instructions take the high 
four bits of the target address from the address of their instruction. The 
link editor makes sure that the same four bits are in the target address 
after relocation; if not, it generates an error message. 

If the entries are local relocation types, the target of the Jump instruction 
is assembled in the instruction at the address to be relocated. The high 
four bits of the jump target are taken from the high 4 bits of the address 
of the instruction to be relocated. 



( 



( 
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Example 3: High/Low Reference-R_REFHI/R_REFLO. This example 
shows an assembler macro that loads the absolute address y, plus a con- 
stant, into Register 6: 

lw $r6,constant+y 

In processing this statement, the assembler generates a as the value y, 
and the following machine language statements: 

f: lui $at ,constant»16 #R_REFHI relocation type at 

address f for symbol y 

g: addiu $r6 ,constant&Oxf f f f ($at) #R_REFLO relocation type at 

address g for symbol y 

In this example, the assembler produces two relocation entries. 

NOTE: When a RJREFHI relocation entry appears, the next relocation 
entry must always be the corresponding RJREFLO entry. This is required 
in order to reconstruct the constant that is to be added to the value for 
relocation. 

In determining the final constant values for the two instructions, the as- 
sembler must take into account that the addiu instruction of the 
R_REFLO relocation entry, sign-extends the immediate value of the con- 
stant. 

In determining the sum of the address for the symbol y and the constant, 
the assembler does the following: 

• It uses the low 16 bits of this sum for the immediate value of the 
R_REFLO relocation address. 

• Because all instructions that are marked with a RJREFLO perform a 
signed operation, the assembler adjusts the high portion of the sum if 
Bit 15 is set. Then it uses the high 16 bits of the sum for the 
immediate value of the R_REFHI instruction at the relocation address. 
For example: 
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lw $r6, 0x10008000 



lui $at,0xl0001 
lw $r6,0x8000($at) 



at = 100010000 
+ FFFF8000 

10008000 



( 



Figure 9-5. 

Example 4: Offset Reference— R__GPREL. This example shows an as- 
sembly macro that loads a global pointer relative value y into Register 6: 



lw 



$r6,y 



In processing this statement, the assembler generates a as the value y 
and the following machine language statement: 

h: lw $r6,0($gp) #R_GPREL relocation type at address h for symbol y 

and a R_GPREL relocation entry would be produced. The assembler then 
uses the difference between the address for the symbol y and the address 
of the global pointer, as the immediate value for the instruction. The link 
editor gets the value of the global pointer used by the assembler from 
gp_yalue in the Optional Header (Table 9-4). 



( 



9.7 Object Files 



This section describes the three object-file formats created by the link edi- 
tor, namely the Impure (OMAGIC), Shared Text (NMAGIC), and De- 
mand Paged (ZMAGIC) formats. Before reading this section, you should 
be familiar in the format and contents of the text, data, and bss segments 
as described in the Section Data section of this chapter. 



( 
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9.7.1 Impure Format (OMAGIC) Files 

An OMAGIC file has the format shown in Figure 9-6. 













.bss 


jr bss segment 

j| data segment 
""§-. text segment 


alig 




.sbss 




.sdata 




.data 




.rdata 




.text 


ned c 


jn 8-byte bo 


undary 



Figure 9-6. Layout of OMAGIC Files in Virtual Memory 

The OMAGIC format has the following characteristics: 

• Each section follows the other in virtual address space aligned on an 
8-byte boundary. 

• No blocking of sections. 

• Text, data and bss segments can be placed anywhere in the virtual 
address space using the link editor's — T, — D and — B options. 

• The addresses specified for the segments must be rounded to 8-byte 
boundaries. 

• The text segment contains only the .text section. 

• The sections in the data segment are ordered as follows: .rdata, .data 
and .sdata 

• The sections in the bss segment are ordered as follows: .sbss and .bss. 
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9.7.2 Shared Text (NMAGIC) Files 

An NMAGIC file has the format shown in Figure 9-7. 



Pagesize boundary 






.bss 








.sbss 




%* 


.sdata 






J^ 1 


.data 




*fc^ 


.rdata 






.text JS"" 


J^ 







bss segment 

data segment 
text segment 



aligned on 8-byte boundary 



Figure 9-7. Layout of NMAGIC Files in Virtual Memory 



An NMAGIC file has the following characteristics: 

• The virtual address of the data segment is rounded up to the next 
pagesize boundary. 

• No blocking of sections. 

• Each section follows the other in virtual address space aligned on an 
8-byte boundary. 

• Only the start of the text section, using the link editor's — T option, 
can be specified for a shared text format file; the start of the text 
section must be a multiple of the pagesize. 



( 



( 
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9.7,3 Demand Paged (ZMAGIC) Files 

A ZMAGIC file is a demand, paged file in the format shown in Figure 
9-8. 

A ZMAGIC file has the following characteristics: 

• The text segment and the data segment are blocked, with pagesize as 
the blocking factor. Blocking reduces the complexity of paging in the 
files. 



2G 
2G-32K 



fill area 



256M 



32K + header 
32K 





miSSSSm 



I Stack Area 9 



sbrk arena 



.bss 



.sbss 
; G areet-fSil 



: i : sdata :: ; ; 



.data 



•rdata : 



empty 



'i^MWW&£;- : 



Ktext?-- 



: 'tteade|| 



empty 



32K (not accessible 
by user) . 



bottom of stack 
increases automatically 
as required. 



bss segment 



►data segment 
(blocked by pagesize) 



"text segment 
(blocked by pagesize) 



Figure 9-8. Layout of ZMAGIC Files in Virtual Memory 
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• The size of the sum of the of the File, Section, and Optional Headers 
(Tables 9-1, 9-4, and 9-6) rounded to 8 bytes is included in 
blocking of the text segment. 

• The .text section must start at 0x8000 (32K) or higher, plus the size 
of the sum of the headers again rounded to 8 bytes. With the 
standard software, the .text section starts at 0x400000 + header size. 



( 



NOTE: This is required because the first 32K bytes of memory are 
reserved for future use by the compiler system to allow data access 
relative to the constant register 0. 

• Only the start of the text section, using the link editor's — T option 
can be specified for a demand paged format file and must be a 
multiple of the pagesize. 

Figure 9-9 shows a ZMAGIC file as it appears in a disk file. 



Symbol Table 



Fill Area 



.sdata 



.data 



.rdata 



fill area 



.text 



headers 



data segment 
(blocked by pagesize) 



text segment 
(blocked by pagesize) 



( 



Figure 9-9. Layout of a ZMAGIC File on Disk 
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9.7.4 Loading Object Files 

The link editor produces object files with their sections in a fixed order 
similar to UNIX system object files that existed before COFF. See Figure 
9-2 for the a description of the sections and how they are formatted. 

The sections are grouped into segments, which are described in the Op- 
tional Header. In loading the object module at run-time, the system (ker- 
nel) loader needs only the magic number in the File Header and the Op- 
tional Header to load an object file for execution. 

The starting addresses and sizes of the segments for all types of object files 
are specified similarly, and they are loaded in the same manner. 

After reading in the File Header and the Optional Header, the system 
(kernel) loader must examine the file magic number to determine if the 
program can be loaded. Then, the system (kernel) loader loads the text 
and data segments. 

The starting offset in the file for the text segment is given by the macro 

N_TXTOFF(f ,a) 

in the header file a.out.h, where f is the File Header structure and a is 
the option header structure for the object file to be loaded. The tsize 
field in the Optional Header (Table 9-4) contains the size of the text seg- 
ment and text_start contains the address at which it is to be loaded. 

The starting offset of the data segment follows the text segment. The 
djsize field in the Section Header (Table 9-6) contains the size of the 
data segment; datajstart contains the address at which it is to be loaded. 

The system (kernel) loader must fill the bss segment with zeros. The 
bssjstart field in the Optional Header specifies the starting address; bsize 
specifies the number of bytes to be filled with zeros. In ZMAGIC files, 
the link editor adjusts bsize to account for the zero filled area it created in 
the data segment that is part of of the .sbss or .bss sections. 

If the object file itself does not load the global pointer register it must be 
set to the gp_yalue field in the Optional Header (Table 9-4). 

The other fields in the Optional Header are gprmask and cprmask[4], 
whose bits show the registers used in the .text section. They can be used 
by the operating system, if desired, to avoid save register relocations on 
context-switch. 
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9.8 Archive Files 



The link editor can link object files in archives created by the archiver. 
The archiver and the format of the archives are based on the UNIX Sys- 
tem V portable archive format. To improve performance, the format of 
the archives symbol table was changed so that it is a hash table, not a lin- 
ear list. 

The archive hash table is accessed through the ranhashinit and ran- 
lookup () library routines in libmld.a, which are documented in the manual 
page ranhash (3x). The archive format definition is in the header file 
ar.h. 



9.9 Link Editor Defined Symbols 

Certain symbols are reserved and their values are defined by the link edi- 
tor. A user program can reference these symbols, but not define one, or 
else an error is generated. Table 9-12 lists the names and values of these 
symbols; the header file sym.h contains their preprocessor macro defini- 
tions. I 

Symbol Value Description 

first location after the .text section 

first location after the .sdata section 

(all initialized data) 

first location after the .bss section (all data) 

1 first location of the .text section 

"•first location of the .data section 

1 first location of the .bss section 

1 the value of the global pointer 



ETEXT 


"etext" 


EDATA 


"edata" 


END 


"end" 


FTEXT 


" ftext" 


FDATA 


" fdata' 


FBSS 


" fbss" 


GP 


"_8P" 



1 compiler system only 



Table 9-12. Link Editor Defined Symbols 



( 
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The first three symbols come from the standard UNIX system link editors 
and the rest are compiler system specific. The last symbol is used by the 
start up routine to set the value of the global pointer, as shown in the fol- 
lowing assembly language statements: 

globl _GP 

la $gp,_GP 

The assembler generates the following machine instructions for these state- 
ments: 

a: lui gp,0 # R_REFHI relocation type at address a for symbol _GP 
b: add gp,0 # R_REFLO relocation type at address b for symbol _GP 



which would cause the correct value of the global pointer to be loaded. 
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10. The Symbol Table 



This chapter describes the symbol table and symbol table routines used to 
create and make entries in the table. The chapter contains the following 
major sections: 

• Overview, which gives the purpose of the Symbol table, a summary of 
its components, and their relationship to each other. 

• Format of Symbol Table Entries, which shows the structures of Symbol 
table entries and the values you assign them through the Symbol Table 
routines. 

• Symbol Table Routine Reference, which lists the symbol table routines 
supplied with the compiler and summarizes the function of each. 

NOTE: Third Eye Software, Inc. owns the copyright (dated 1984) to the 
format and nomenclature of the Symbol Table used by the compiler sys- 
tem as documented in this chapter. 

Third Eye Software, Inc. grants reproduction and use rights to all parties, 
PROVIDED that this comment is maintained in the copy. 

Third Eye makes no claims about the applicability of this symbol table to 
a particular use. 



10.1 Overview 



The symbol table in created by the compiler front-end as a stand-alone 
file. The purpose of the table is to provide information to the link editor 
and the debugger in performing their respective functions. At the option 
of the user, the link editor includes information from the Symbol table in 
the final object file for use by the debugger. See Figure 9-1 in Chapter 9 
for details. 
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Created only if debugging 
is ON. 



Symbolic Header 



Line Numbers 



Dense Numbers; 



Procedure Descriptor Table 



Local Symbols 



Optimization Symbols 



Auxiliary Symbols 



Local Strings 



|i^im|||Siir>||| 



File Descriptor 



Relative File Descriptor 



External Symbols 



Tj 1 table per compilation. fj 1 table per source file and 

per include file. 



c 



( 



Figure 10-1. The Symbol Table - Overview 



The elements that make up the Symbol table are shown in Figure 10-1. 
The front-end creates one group of tables (the shaded areas in Figure 
10.1) that contain global information relative to the entire compilation. It 
also creates a unique group of tables (the unshaded areas in the figure) 
for the source file and each of its include files. 



( 
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Compiler front-ends, the assembler, and the link editor interact with the 
symbol table as summarized below: 

• The front-end, using calls to routines supplied with the compiler 
system, enters symbols and their descriptions in the table. 

• The assembler fills in line numbers, optimization symbols, updates 
Local Symbols and External Symbols, and updates the Procedure 
Descriptor table. 

• The link editor eliminates duplicate information in the External 
Symbols and the External Strings tables, removes tables with duplicate 
information, updates Local Symbols with relocation information, and 
creates the Relative File Descriptor table. 

The major elements of the table are summarized in the paragraphs that 
follow. Some of these elements are explored in more detail later in the 
chapter. 

Symbolic Header. The Symbolic Header (HDRR for HeadDeR Record) 
contains the sizes and locations (as an offset from the beginning of the 
file) of the subtables that make up the Symbol Table. Figure 10-2 shows 
the symbolic relationship of the header to the other tables. 

















Symbolic Header 














Line Numbers 








Dense Numbers 




Procedure 
Descriptor Table 




Local Symbols 




Optimization Symbols 


Auxliary Symbols 




Local Strings 




External Strings 






File Descriptor Table 













Figure 10-2. Functional Overview of the Symbolic Header 
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Line Numbers. The assembler creates the Line Number table. It creates 
an entry for every instruction. Internally, the information is stored in an 
encoded form. The debugger uses the entries to map instruction to the 
source lines and vice versa. 

Dense Numbers. The Dense Number table is an array of pairs. An in- f 
dex into this table is called a dense number. Each pair consists of a file 1 
table index (ifd) and an index (isym) into Local Symbols. The table fa- 
cilitates symbol look-up for the assembler, optimizer, and code generator 
by allowing direct table access rather than hashing. 

Procedure Descriptor Table. The Procedure Descriptor table contains 
register and frame information, and offsets into other tables that provide 
detailed information on the procedure. The front-end creates the table 
and links it to the Local Symbols table. The assembler enters information 
on registers and frames. The debugger uses the entries in determining the 
line numbers for procedures and frame information for stack traces. 

Local Symbols. The Local Symbols table contains descriptions of pro- 
gram variables, types, and structures, which the debugger uses to locate 
and interpret runtime values. The table gives the symbol type, storage 
class, and offsets into other tables that further define the symbol. 

A unique Local Symbols table exists for every source and include file; the g 
compiler locates the table through an offset from the file descriptor entry ■ 
that exists for every file. The entries in Local Symbols can reference re- ^ 
lated information in the Local Strings and Auxiliary Symbols subtables. 
This relationship is shown in Figure 10-3. 



( 
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| File Descriptor Table 1 






Entry for File 








1 






Entry for File 1 






Local Symbols 
















Entry for File 2 








Local Strings I 
Auxiliaries 1 


Entry for File n 


















Local Strings I 
Auxiliaries I 






i:.:!:"::.:::"::.:"'::!.":"::.™!".::!"::.:::"!:.!!!' 














Local Strings I 
Auxiliaries 1 

















Figure 10-3. Logical Relationship between the File Descriptor Table and 

Local Symbols 



Optimization Symbols. To be defined at a future date. 

Auxiliary Symbols. The Auxiliary Symbols tables contain data type infor- 
mation specific to one language. Each entry is linked to an entry in Local 
Symbols. The entry in Local Symbols can have multiple, contiguous en- 
tries. The format of an auxiliary entry depends on the symbol type and 
storage class. Table entries are required only when the compiler debug- 
ging option is ON. 

Local Strings. The Local Strings subtables contain the names of local 
symbols. 

External Strings. The External Strings table contains the names of exter- 
nal symbols. 
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File Descriptor. The File Descriptor table contains one entry each for 
each source file and each of its include files. (The structure of an entry is 
given in Table 10-12 later in this chapter.) The entry is composed of 
pointers to a group of subtables related to the file. The physical layout of 
the subtables is shown in Figure 10-4. 



File Descriptor Table 



File Descriptor Entry 



Line Numbers 



Procedure 
DRsnriptnr Table 



Local Symbols 



Optimization Symbols 



Auxliary Symbols 



Local Strings 



Relative File Descriptor 



( 



Figure 10-4. Physical Relationship of a File Descriptor Entry to Other 

Tables 



The file descriptor entry allows the compiler to access a group of subtables 
unique to one file. The logical relationship between entries in this table 
and in its subtables is shown in 10-5. 



( 
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File Descriptor Table 



Entry for File 



Entry for File 1 



Entry for File 2 





Line Numbers 




Procedure 
Descriptor Table 




Local Symbols 


Mm 


Optimization Symbols 


P"*' 


Auxliary Symbols 




Local Strings 




Relative File Descriptor 



Line Numbers 



Procedure 
Descriptor Table 



Local Symbols 



Optimization Symbols 



Auxliary Symbols 



Local Strings 



Relative File Descriptor 



Figure 10-5. Logical Relationship Between the File Descriptor Table 

and Other Tables 



Relative File Descriptor. See the section Link Editor Processing later in 
this chapter. 
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External Symbols. The External Symbols contains global symbols entered 
by the front-end. The symbols are defined in one module and referenced 
in one or more other modules. The assembler updates the entries, and 
the link editor merges the symbols and resolves their addresses. 



10.2 Format of Symbol Table Entries 



10.2.1 Symbolic Header 

The structure of the Symbolic Header is shown below in Table 10-1; the 
sym.h header file contains the header declaration. 



c 



c 
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Declaration 



Name 



Description 



short 


magic 


short 


vstamp 


long 


ilineMax 


long 


cbLine 


long 


cbLineOffset 


long 


idnMax 


long 


cbDnOffset 


long 


ipdMax 


long 


cbPdOffset 


long 


isymMax 


long 


cbSymOffset 


long 


ioptMax 


long 


cbOptOffset 


long 


iauxMax 


long 


cbAuxOffset 


long 


issMax 


long 


cbSsOffset 


long 


issExtMax 


long 


cbSsExtOffset 


long 


ifdMax 


long 


cbFdOffset 


long 


crfd 


long 


cbRfdOffset 


long 


iextMax 


long 


cbExtOffset 



to verify validity of the table 

version stamp 

number of line number entries 

number of bytes for line number entries 

index to start of line numbers 

max index into dense numbers 

index to start dense numbers 

number of procedures 

index to procedure descriptors 

number of local symbols 

index to start of local symbols 

maximum index into optimization entries 

index to start of optimization entries 

number of auxiliary symbols 

index to the start of auxiliary symbols 

max index into local strings 

index to start of local strings 

max index into external strings 

index to the start of external strings 

number of file descriptors 

index to file descriptor 

number of relative file descriptors 

index to relative file descriptors 

maximum index into external symbols 

index to the start of external symbols 



Table 10-1. Format of the Symbolic Header 

The lower byte of the vstamp field contains LSJSTAMP and the upper 
byte MSJSTAMP (see the stamp. h header file). These values are defined 
in the stamp. h file. The iMax fields and the cbOffset field must be set to 
if one of the tables shown in Table 10-1 isn't present. The magic field 
must contain the constant magic Sym, also defined in longsymconst.h. 
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10.2.2 Line Numbers 



Table 10-2 shows the format of an entry in the Line Numbers table; the 
sym.h header file contains its declaration. 



Declaration 



Name 



( 



typedef long 



LINER, *PLINER 



Table 10-2. Format of a Line Number Entry 



The line number section in the Symbol table is rounded to the nearest 
four-byte boundary. 

Line numbers map executable instructions to source lines; one line num- 
ber is stored for each instruction associated with a source line. It is stored 
as a long integer in memory and in packed format on disk. 

The layout on disk is as follows: 



Bit 


8 


4 



























Delta 


Count 





( 



Figure 10-6. Disk Layout of Line Numbers 



The compiler assigns a line number to only those lines of source code that 
generate one or more executable instructions. 

Delta is a four-bit value in the range — -7...?, defining the number of 
source lines between the current source line, and the previous line gener- 
ating executable instructions. The Delta of the first line number entry is 
the displacement from the InLow field in the Procedure Descriptor Table. 

Count is a four-bit field with a value in the range 0...15 indicating the 
number (1...16) of executable instructions associated with a source line. 



( 
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If more than 16 instructions (15+1) are associated with a source line, new 
line number entries are generated with Delta = 0. 

An extended format of the line number entry is used when Delta is out- 
side the range of —7... 7. 

The layout of the extended field on disk is as follows: 



Bit 



Bit 



1 



Constant —8 



Count 



Bit 8 



Upper eight bits of Delta 
8 4 



Lower eight bits of Delta 



Figure 10-7. Disk Layout of Extended Field 



NOTE: The compiler allows a maximum of 32,767 comment lines, blank 
lines, continuation lines and other lines not producing executable instruc- 
tions, between two source lines that do. 

Line number example. This section gives an example of how the com- 
piler assigns line numbers. For the source listing shown in Figure 10-8, 
the compiler generates line numbers only for the highlighted lines (6, 7, 
17, 18, and 19); the other lines are either blank or contain comments. 
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Illlllll 


^include 


<stdio.h> 


z 


main {) 




Illlllll 


{ 






illllllll 




lliisllil 


illllllllllllll^ 


lllllll-l 








6 




printf ("this program just prints its input\n") ; 


7 




while 


((c = getc(stdin)) != EOF) { 


.-• K!' 






/* this is a greater than an seven line comment 


||;|||:|; 






• * 1 


:•"• V>V 






lllll^ 


11 






Illlllll 


.'.':' 4'?.". 






;i|||l:|||||||lll|||||||| 


Illllll 






||§|a|||||||||||||||||||| 


lllllll 






liiiiillli-illllll^^ 


Illllll 






8:!$!$^^ 


iiiiii 






lllllii^^ 


17 






printf ("%c", c) ; 


18 




} /* 


end while */ 


19 


} 


/* end 


main */ 



Figure 10-8. Source Listing for Line Number Example 



( 



Figure 10-9 (on the next page) shows the instructions generated for lines 
3, 7, 17, 18, and 19. Figure 10-10) shows the compiler-generated liner 
entries for each source line. 



( 
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Source 
Line 


Liner 


Contents 


Meaning 


3 

6 

7 

7 

17 1 
18 
1* 
19 


02 

31 

1f 

03 

82 00 0a 

1f 

03 

15 


delta 0, count 2 
delta 3, count 1 
delta 1 , count 15 
delta 0, count 3 
-8 , count 2, delta 10 
delta 1, count 15 
delta 2 , count 3 
delta 1 , count 5 


1 Extended format (count is greater than 
seven lines). 

Continuation. 



Figure 10-9. Source Listing for Line Number Example 
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[main: 3, 0x4001a0] 


addiu 


Sp,Sp,-32 y^^s. 


[main: 3, 0x4001a4] 


sw 


rsi,20(sp) m0m 


[main: 3, Ox4001a8] 


sw 


rl6,l6(sp) p3i§:.6:| 


[main: 6, 0x4001ac] 


jal 


printf ^^W 


[main: 6 L 0x4001b0] 


addiu 


r4,gp, -32752 


[main: 7, 0x4001b4] 


lw 


rl4,-32552(gp) 


[main: 7, 0x4001b8] 


nop 




[main: 7, Ox4001bc] 


addiu 


rl5,rl4,-l 


[main: 7, 0x4001c0] 


bltz 


rl5,0x4001e4 


[main: 7, 0x4Q01c4] 


sw 


rl5, -32552 (gp) 


[main: 7, 0x4001c8] 


lw 


r24, -32548 (gp) 


[main: 7, 0x4001cc] 


nop 




[main: 7, 0x4001d0] 


lbu 


r25,0(r24) /fl*\ 


[main: 7, 0x4001d4] 


addiu 


r8,r24,l llllll 


[main: 7, Ox4001d8] 


sb 


r25,31(sp) \^y 


[main: 7, 0x4001dc] 


b 


0x4001f4 


[main: 7, 0x4001e0] 


sw 


r8, -32548 (gp) 


[main: 7, 0x400le4] 


jal 


_filbuf 


[main: 7, Ox4001e8] 


addiu 


r4,gp, -32552 


[main:?, 0x4001ec] 


move 


rl6,r2 


[main: 7, 0x4001f0] 


sb 


rl6,31(sp) 


[main: 7, 0x4001f4] 


lbu 


r9,31(sp) 


[main: 7, Ox4001f8] 


li 


rl,-l 


[main: 7, 0x4001fc] 


beq 


r9,rl, 0x400260 


[main* 7, 0x4009.00] 


nop 




[main: 17, 0x400204] 


lbu 


r5,31(sp) /$$8\ 


[main: 17, 0x400208] 


jal 


printf f-PPj 
r4,gp, -32716 W 


Imain:17, 0x40020c] 


addiu 


[main: 18, 0x400210] 


lw 


rl0,-32552(gp) 


[main: 18, 0x400214] 


nop 




[main: 18, 0x400218] 


addiu 


rll,rlO,-l 


[main: 18, 0x40021c] 


bltz 


rll, 0x400240 


[main: 18, 0x400220] 


sw 


rll,-32552(gp) 


[main: 18, 0x400224] 


lw 


rl2,-32548(gp) 


[main: 18, 0x400228] 


nop 


jf^^^S. 


[main: 18, 0x40022c] 


lbu 


rl3,0(rl2) AA 
rl4,rl2,l \mW§ 


[main: 18, 0x400230] 


addiu 


[main: 18, 0x400234] 


sb 


rl3,31(sp) V^>/ 


[main: 18, 0x400238] 


b 


0x400250 


[main: 18, 0x40023c] 


sw 


rl4, -32548 (gp) 


[main: 18, 0x400240] 


jal 


_filbuf 


[main: 18, 0x400244] 


addiu 


r4,gp, -32552 


[main: 18, 0x400248] 


move 


rl6,r2 


[main: 18, 0x40024c] 


sb 


rl6,31(sp) 


[main: 18, 0x400250] 


lbu 


rl5,31(sp) 


[main: 18, 0x400254] 


li 


rl,-l 


[main: 18, 0x400258] 


bne 


rl5,rl, 0x400204 


[mftin-1R r Ov4009.fin] 


nop 




[main: 19, 0x400260] 


b 


0x400268 


[main: 19, 0x400264] 


nop 


^yyyyy^. 


[main: 19, 0x400268] 


lw 


v*31 on/en^ *<>>>:+:'■■£< 


ro±,*5U(.sp; f t CI : 1 
rl6,16(sp) WmyW 


[main: 19, 0x40026o] 


lw 


[main: 19, 0x400270] 


jr 


r31 ^^ 


[main: 19, 0x400274] 




addiu 


sp, sp,32 



( 



( 



Figure 10-10. Source Listing for Line Number Example 
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10.2.3 Procedure Descriptor Table 

Table 10-3 shows the format of an entry in the Procedure Descriptor ta- 
ble; the sym.h header file contains its declaration. 



Declaration 



Name 



Description 



unsigned, long 


adr 


long 


isym 


long 


iline 


long 


regmask 


long 


regoffset 


long 


iopt 


long 


fregmask 


long 


fregoffset 


long 


frameoffset 


long 


framereg 


long 


pcreg 


long 


InLow 


long 


InHigh 


long 


cbLineOffset 



memory address of start of procedure 

start of local symbols 

procedure's line numbers 

saved register mask 

saved register offset 

procedure's optimization symbol 

entries 

save floating point register mask 

save floating point register offset 

frame size 

frame pointer register 

index or reg of return program 

counter 

lowest line in the procedure 

highest line in the procedure 

byte offset for this procedure from 

the base of the file descriptor 

entry. 



Table 10-3. Format of a Procedure Descriptor Table Entry 
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10.2.4 Local Symbols 

Table 10-4 shows the format of an entry in the Local Symbols table; the 
sym.h header file contains its declaration. 



Declaration 



Name 



Description 



long 


iss 


long 


value 


unsigned 


st : 6 


unsigned 


sc : 5 


unsigned 


reserved : 1 


unsigned 


index : 20 



index into local strings of symbol name 
value of symbol. See Table 10-5. 
symbol type. See Table 10-6. 
storage class. See Table 10-7. 

index into local or auxiliary symbols 
See Table 3-5. 



Table 10-4. Format of a Local Symbols Entry 



The meanings of the fields in a local symbol entry are explained in the 
following paragraphs. 

iss. The iss (for index into string space) is an offset from the issBase 
field of an entry in the file descriptor table, to the name of the symbol. 

value. An integer representing an address, size, offset from a frame 
pointer. The value is determined by the symbol type, as illustrated in Ta- 
ble 10-5. 



( 



st and sc. The symbol type (st) defines the symbol; the storage class 
(sc), where applicable explains how to access the symbol type in memory. 
The valid st and sc constants are given in Tables 10-6 and 10-7. These 
constants are defined in symconst.h. 

index. The index is an offset into either Local Symbols or Auxiliary 
Symbols, depending of the storage type (st) as shown in Table 10-5. The 
compiler uses isymBase in the file descriptor entry as the base for a Local 
Symbol entry and iauxBase for an Auxiliary Symbols entry. 



( 



10-16 Assembly Language Programmer's Guide 



IRIS-4D Series 



Symbol Type 


Storage Class 


Index 


Value 


stFile 


scText 


isymMac 


address 


stLabel 


scText 
scD/B 


indexNil 


address 


stGlobal 


iaux 


address 


stStatic 


scD/B 1 


iaux 


address 

i 

frame offset 


stParam 


scAbs 


iaux 




scRegister 


iaux 


register number 
frame offset 




scVar 


iaux 




scVarRegister 


iaux 


register number 


stLocal 


scAbs 


iaux 


frame offset 




scRegister 


iaux 


register number 


stProc 


scText 


iaux 


address 




scNil 


iaux 


address 




scUndefined 


iaux 


address 


stStaticProc 


scText 


iaux 


address 


stMember 








enumeration 


sclnfo 


indexNil 


ordinal ~ 
bit offset 


structure 


sclnfo 


iaux 


union 


sclnfo 


iaux 


bit offset 



^scD/B is the storage class determined by the assembler, either 
large/small or data/bss. 

2 
frame offset is the offset from the virtual frame pointer. 

3 
bit offset is computed from the beginning of the procedure. 



Table 10-5 (Part 1 of 2). Index and Value as a Function of Symbol 
Type and Storage Class 
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Symbol Type 


Storage Class 


Index 


Value 


stBlock 




1 
isymMac 




enumeration 


sclnfo 


max enumeration 


structure 


sclnfo 


isymMac 


size 


text bock 


scText 


isymMac 


relative addres? 


common block 


scCommon 


isymMac 


size 


variant 


scVariant 


isymMac 


isymTag 3 
iauxRanges 


variant arm 


sclnfo 


isymMac 


union 


sclnfo 


isymMac 


size 


stEnd 








enumeration 


sclnfo 


isymStart 


2 
relative address 


file 


scText 


isymStart 


procedure 


scText 


isymStart 


relative addresSr 


structure 


sclnfo 


isymStart 


o 


text block 


scText 


isymStart 


relative address 


union 


sclnfo 


isymStart 





common block 


scCommon 


isymStart 





variant 


scVariant 


isymStart 





variant arm 


sclnfo 


isymStart 





stTypedef 


sclnfo 


iaux 






1 
isymMac is the isym of the corresponding stEnd symbol plus 1 . 

^relative address is the relative displacement from the beginning of the 
procedure. 

3 
isymTab is the isym to the symbol that is the tag for the variant. 

4 
iauxRanges is the iaux to ranges for the variant arm. 

^isymStart is the isym of the correspodning begin block (stBlock, stFile, 
stProc, etc.) 



( 



Table 10-5 (Part 2 of 2). Index and Value as a Function of Symbol 
Type and Storage Class 



The link editor ignores all symbols except the types stProc, stStatic, 

stLabel, stStaticProc, which it will relocate. Other symbols are used only ^ 

by the debugger, and need be entered in the table only when the compiler I 

debugger option is ON. ^ 



10-18 Assembly Language Programmer's Guide 



IRIS-4D Series 



Symbol Type (st). Table 10-6 gives the allowable constants that can be 
specified in the st field of Local Symbols entries; the symconst.h header 
file contains the declaration for the constants. 



Constant 


Value 


Description 


stNil 





Dummy entry 


stGlobal 


1 


external symbol 


stStatic 


2 


static 


stParam 


3 


procedure argument 


stLocal 


4 


local variable 


stLabel 


5 


label 


stProc 


6 


Procedure 


stBlock 


7 


start of block 


stEnd 


8 


end block, file, or procedures 


stMember 


9 


member of structure, union, or 
enumeration. 


stTypedef 


10 


type definition 


stFile 


11 


file name 


stStaticProc 


14 


load time only static procs 


stConstant 


15 


const 



Table 10-6. Symbol Type (st) Constants Supported by the Compiler 
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Storage Class (st) Constants. Table 10-7 gives the allowable constants 
that can be specified in the sc field of Local Symbols entries; the sym- 
const.h header file contains the declaration for the constants. 



Constant 



Value Description 



scNil 





scText 


1 


scData 


2 


scBss 


3 


scRegister 


4 


scAbs 


5 


scUndefined 


6 


reserved 


7 


scBits 


8 


scDbx 


9 


scReglmage 


10 


sclnfo 


11 


scUserStruct 


12 


scSData 


13 


scSBss 


14 


scRData 


15 


scVar 


16 


scCommon 


17 


scSCommon 


18 


scVarRegister 


19 


scVariant 


20 


scUndefined 


21 



dummy entry. 

text symbol 

initialized data symbol 

un-initialized data symbol 

value of symbol is register number 

symbol value is absolute; not to be 

relocated. 

Used but undefined in the current module. 

this is a bit field 

dbx internal use 

register value saved on stack 

symbol contains debugger information 

address in struct user for current process 

(load time only) small data 

(load time only) small common 

(load time only) read only data 

Var parameter (Fortran or Pascal) 

common variable 

small common 

var parameter in a register 

variant records 

small undefined 



( 



Table 10-7. Storage Class Constants Supported by the Compiler 



10.2.5 Optimization Symbols 

Reserved for future use. 



( 
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10.2.6 Auxiliary Symbols 

Table 10-8 shows the format of an entry, which is a union, in Auxiliary 
Symbols; the sym.h file contains its declaration. 



Declaration 


Name 


Description 


TIR 


ti 


type information record 


RNDXR 


rndx 


relative index into local symbols 


long 


dnLow 


low dimension 


long 


dnHigh 


high dimension 


long 


isym 


index into local symbols for stEnd 


long 


iss 


index into local strings (not used) 


long 


width 


width of a structure field not 
declared with the default value 
for size. 


long 


count 


count of ranges for variant arm 



Table 10-8. Storage Class Constants Supported by the Compiler 



All of the fields except the ti field are explained in the order they appear 
in the above layout. The ti field is explained last. 

rndx. Relative File Index. The front-end fills this field in describing 
structures, enumerations, and other complex types. The relative file index 
is a pair of indexes. One index is an offset from the start of the File De- 
scriptor table to one of its entries. The second is an offset from the file 
descriptor entry to an entry in the Local Symbols or Auxiliary Symbols 
table. 

dnLow. Low Dimension of Array. 

dnHigh. High Dimension of Array. 

isym. Index into Local Symbols. This index is always an offset to an 
stEnd entry denoting the end of a procedure. 
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width. Width of Structured Fields. 



count. Range Count. Used in describing case variants. Gives how many 
elements are separated by commas in a case variant. 

ti. Type Information Record. Table 10-9 shows the format of a ti en- 
try; the sym.h file contains its declaration. 



( 



Declaration 


Name 


Description 


unsigned 


fBitfield : 1 


setif bit width is specified 


unsigned 


continued : 1 


next auxiliary entry has 
tq info 


unsigned 


bt : 6 


basic type 


unsigned 


tq4 


4 


Type qualifier. 


unsigned 


tq5 


4 




unsigned 


tqO 


4 




unsigned 


tql 


4 




unsigned 


tq2 


4 




unsigned 


tq3 


4 





Table 10-9. Format of a ti Type Information Record Entry 



( 



All groups of auxiliary entries have a type information record with the fol- 
lowing entries: 

m f bitfield. Set if the basic type (bt) is of no n-strandard width. 

• bt (for basic type) specifies if the symbol is integer, real complex, 
numbers , a structure, etc. The valid entries for this field are shown in 
Table 10-10; the sym.h file contains its declaration. 

• tq (for type qualifier) defines whether the basic type (bt) has an 
array of, function returning, or pointer to qualifier. The valid entries 
for this field are shown in Table 10-11; the sym.h file contains its 
declaration. 



( 
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Default 




Constant 


Value 


Size* 


Description 


btNil 








undefined, void 


btAdr 


1 


32 


address - same size as 


pointer 








btChar 


2 


8 


symbol character 


btUChar 


3 


8 


unsigned character 


btShort 


4 


16 


short (16 bits) 


btUShort 


5 


16 


unsigned short 


btlnt 


6 


32 


integer 


btUInt 


7 


32 


unsigned integer 


btLong 


8 


32 


long (32 bits) 


btULong 


9 


32 


unsigned long 


btFloat 


10 


32 


floating point (real) 


btDouble 


11 


64 


double-precision floating 


point real 








btStruct 


12 


n/a 


structure (Record) 


btUnion 


13 


n/a 


union (variant) 


btEnum 


14 


32 


enumerated 


btTypedef 


15 


n/a 


defined via a typedef; 
rndx points at a 
stTypedef symbol. 


btRange 


16 


32 


subrange of integer 


btSet 


17 


32 


pascal sets 


btComplex 


18 


64 


fortran complex 


btDComplex 


19 


128 


fortran double complex 


btlndirect 


20 




Indirect definition; rndx 
points to TIR aux 


btMax 


64 






*Size in bits. 









Table 10-10. Basic Type (bt) Constants 
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Constant 


Value 


Description 


tqNil 





Place holder. No qualifier. 


tqPtr 


1 


pointer to 


tqProc 


2 


function returning 


tqArray 


3 


array of 


tqVol 


5 


volatile 


tqMax 


8 





( 



Table- 10-11. Type Qualifier (tq) Constants 



10.2.7 File Descriptor Table 

Table 10-12 shows the format of an entry in the File Descriptor table; 
the sym.h file contains its declaration. 



( 



c 
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Declaration 



Name 



Description 



unsigned, long 


adr 


long 


rss 


long 


issBase 


long 


cbSs 


long 


isymBase 


long 


csym 


long 


ilineBase 


long 


cline 


long 


ioptBase 


long 


copt 


short 


ipdFirst 


short 


cpd 


long 


iauxBase 


long 


caux 


long 


rf dBase 


long 


crfd 


unsigned 


lang : 5 


unsigned 


fMerge : 1 


unsigned 


fReadin : 1 


unsigned 


fBigendian : 1 


unsigned 


reserved : 22 


long 


cbLineOffset 


long 


cbLine 



memory address of start of file 

source file name 

start of local strings 

number of bytes in local strings 

start of local symbol entries 

count of local symbol entries 

start of line number entries 

count of line number entries 

start of optimization symbol entries 

count of optimization symbol entries 

start of procedure descriptor table 

count of procedures descriptors 

start of auxiliary symbol entries 

count of auxiliary symbol entries 

index into relative file descriptors 

relative file descriptor count 

language for this file 

whether this file can be merged 

true if it was read in (not 

just created) 

if set, was compiled on big endian 

machine aux's is in compile 

host's sex 

reserved for future use 

byte offset from header or file ln's 



Table 10-12. Format of File Descriptor Entry 
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10.2.8 External Symbols 

The External Symbols table has the same format as Local Symbols, except 
an offset (ifd) field into the File Descriptor table has been added. This 
field is used to locate information associated with the symbol in an Auxil- 
iary Symbols table. Table 10-13 shows the format of an entry in External 
Symbols; the sym.h file contains its declaration. 



Declaration 



Name 



Description 



short 


reserved 


short 


ifd 


SYMR 


asym 



reserved for future use 
pointer to file descriptor entry 
Same as Local Symbols 



Table 10-13. Format of an Entry in External Symbols 



( 



( 
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Appendix A: Instruction Summaries 



In the tables in this appendix, the operand terms have the following mean- 
ings: 



Operand 


Description 


destination 


destination register 


address 


expression 


source 


source register 


expression 


aboslute value 


immediate 


immediate value 


label 


symbol label 


breakcode 


value that determines the break 



Table A-1. Operand Meanings 
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Description 


Op-coda 


Operand 


Load Address 


la 


destination, address 


Load Byte 


lb 




Load Byte Unsigned 


lbu 




Load Halfword 


lh 




Load Halfword Unsigned 


lhu 




Load Word 


lw 




Load Coprocessor z 


lwcz 




Load Word Left 


lwl 




Load Word Right 


lwr 




Store Byte 


sb 


source, address 


Store Halfword 


sh 




Store Word 


sw 




Store Word Coprocessor z 


swcz 




Store Word Left 


swl 




Store Word Right 


swr 




Unaligned Load Halfword 


ulh 




Unaligned Load Halfword Unsigned 


ulhu 




Unaligned Load Word 


ulw 




Unaligned Store Halfword 


ush 




Unaligned Store Word 


usw 





( 



Table A-2. Main Processor Instruction Summary 



( 
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Description 


Op-code 


Operand 


Load Immediate 


li 


destination, expression 


Load Upper Immediate 


lui 




Restore From Exception 


rfe 




Syscall 


syscall 




Absolute Value 


abs 


destination, srcl 


Negate (with overflow) 


neg 


destination/srcl 


Negate (without overflow) 


negu 




NOT 


not 




Add (with overflow) 


add 


destination, srcl , src2 


Add (without overflow) 


addu 


destination/srcl , src2 


AND 


and 


destination, srcl , immediate 


Divide (with overflow) 


div 


destination/srcl , immediate 


Divide (without overflow) 


divu 




EXCLUSIVE OR 


xor 




Multiply 


mul 




Multiply (with overflow) 


mulo 




Multiply (with overflow) 


mulou 




Unsigned 






NOT OR 


nor 




OR 


or 




Remainder 


rem 




Remainder Unsigned 


remu 




Rotate Left 


rol 




Rotate Right 


ror 




Set Equal 


seq 




SEt Less Than 


sit 




Set Less Than Unsigned 


situ 




Set Less /Equal 


sle 




Set Less /Equal Unsigned 


sleu 




Set Greater Than 


sgt 




Set Greater Than Unsigned 


sgut 




Set Greater/Equal 


sge 




Set Greater/Equal Unsigned 


sgeu 




Set Not Equal 


sne 




Shift Left Logical 


sll 




Shift Right Arithmetic 


sra 




Shift Right Logical 


srl 




Subtract (with overflow) 


sub 




Subtract (without overflow) 


subu 




Multiply 


mult 


srcl,src2 


Multiply Unsigned 


multu 





Tabie A-2. Main Processor Instruction Summary (continued) 
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Description 


Op-code 


Operand 


Branch 


b 


label 


Branch Coprocessor z True 


bczt 




Branch Coprocessor z False 


bczf 




Branch on Equal 


beq 


srcl,src2, label 


Branch on Greater 


bgt 


srcl , immediate , label 


Branch on Greater/Equal 


bge 




Branch on Greater/Equal Unsigned 


bgeu 




Branch on Greater Than Unsigned 


bgtu 




Branch on Less 


bit 




Branch on Less/Equal 


ble 




Branch on Less/Equal Unsigned 


bleu 




Branch on Less Than Unsigned 


bltu 




Branch on Not Equal 


bne 




Branch and Link 


bal 


label 


Branch on Less Than Zero and Link 


bltzal 




Branch on Greater or Equal to zero 


bgezal 




and Link 






Branch on Equal Zero 


beqz 


srcl, label 


Branch on Greater/Equal Zero 


bgez 




Branch on Greater Than Zero 


bgtz 




Branch on Less/Equal Zero 


blez 




Branch on Less Than Zero 


bltz 




Branch on Not Equal Zero 


bnqz 




Jump 


J 


address 


Jump and Link 


jal 


srcl 


Break 


break 


breakcode 


Coprocessor z Operation 


cz 


expression 


Move 


move 


destination, srcl 


Move From HI Register 


mfhi 


register 


Move To HI Register 


mthi 




Move From LO Register 


mflo 




Move To LO Register 


mtlo 




Move From Coprocessor z 


mfcz 


dest-gpr, source 


Move To Coprocessor z 


mtcz 


src-gpr, destination 


Control From Coprocessor z 


cfcz 


src-gpr, destination 


Control to Coprocessor z 


ctcz 


dest-gpr, source 



( 



( 



Table A-2. Main Processor Instruction Summary (continued) 
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Description 



Op-code Operand 



Translation Lookaside Buffer Probe 
Translation Lookaside Buffer Read 
Translation Lookaside Buffer Write Random 
Translation Lookaside Write Index 



tlbp 
tlbr 
tlbwr 
tlbwi 



Table A-3. System Coprocessor Instruction Summary 



Description 



Op-code Operand 



Load Fp 
Double 
Single 



l.d 
l.s 



destination, offset (base) 



Store FP 
Double 
Single 



s.d 

s.s 



source , offset(base) 



Absolute Value Fp 
Double 
Single 



abs.d 
abs.s 



destination, srcl 



AddFp 

Double 

Single 
Divide Fp 

Double 

Single 
Multiply 

Double 

Single 
Subtract Fp 

Double 

Single 



add.d 
add.s 

div.d 
div.s 

mul.d 
mul.s 

sub.d 
sub.s 



destination, srcl ,src2 



Table A-4. Floating Point Instruction Summary 
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Description 


Op-code 


Operand 


Convert Source to 






Specified Precision Fp 






Double to Single 


cvt.s.d 


destination, src2 


Fixed Point to Single 


cvt.s.w 




Fixed Point to Double 


cvt.d.w 




Single to Double 


cvt.d.s 




Double to Fixed Point 


cvt.w.d 




Single to Fixed Point 


cvt.w.s 




Negate Floating Point 






Double 


neg.d 




Single 


neg.s 





Table A-4. Floating Point Instruction Summary (continued) 



( 



c 
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Description Op-code Operand 

Compare Fp 

F Single c.f.s srcl,src2 

F Double c.f.d 

UN Single c.un.s 

UN Double c.un.d 

*EQ Single c.eq.s 

*EQ Double c.eq.d 

UEQ Single c.ueq.s 

UEQ Double c.ueq.d 

OLT Single c.olt.s 

OLT Double c.olt.d 

ULT Single c.ult.s 

ULT Double c.ult.d 

OLE Single c.ole.s 

OLE Double c.ole.d 

ULE Single c.ule.s 

ULE Double c.ule.d 

SF Single c.sf.s 

SF Double c.sf.d 

NGLE Single c.ngle.s 

NGLE Double c.ngle.d 

SEQ Single c.deq.s 

SEQ Double c.seq.d 

NGL Single c.ngl.s 

NGL Double c.ngl.d 



Table A-4. Floating Point Instruction Summary (continued) 

NOTE: Starred items (*) are the most common Compare instructions. 
The machine has the other Compare instructions for IEEE compatibility. 
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Description 



Op-code Operand 



Compare Fp 
*LT Single 
*LT Double 

NGE Single 
NGE Double 



c.lt.s 
c.lt.d 

c.nge.s 
c.nge.d 



srcl,src2 



( 



*LE Single 
*LE Double 



c.le.s 
c.le.d 



NGT Single 
NGT Double 



c.ngt.s 
c.ngt.d 



Move Fp 

Single 
Double 



mov.s 
mov.d 



destination, srcl 



Table A-4. Floating Point Instruction Summary (continued) 



NOTE: Starred items (*) are the most common Compare instructions. 
The machine has the other Compare instructions for IEEE compatibility. 



( 



( 
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Appendix B: Basic Machine Definition 



The assembly language instructions described in this book are a superset of 
the actual machine instructions. Generally, the assembly language instruc- 
tions match the machine instructions; however, in some cases the assembly 
language instruction are macros that generate more than one machine in- 
struction (the assembly language multiplication instructions are examples) . 

You can, in most instances, consider the assembly instructions as machine 
instructions; however, for routines that require tight coding for perform- 
ance reasons, you must be aware of the assembly instructions that generate 
more than one machine language instruction, as described in this appen- 
dix. 



B.1 Load and Store Instructions 



If you use an address as an operand in an assembler Load or Store in- 
struction and the address references a data item that is not addressable 
through register $gp or the data item does not have an absolute address in 
the range -32768... 32767, the assembler instruction generates a lui (load 
upper immediate) machine instruction and generates the appropriate offset 
to Sat. The assembler then uses $at as the index address for the refer- 
ence. This condition occurs when the address has a relocatable external 
name offset (or index) from where the offset began. 

The assembler's la (load address) instruction generates an addiu (add un- 
signed immediate) machine instruction. If the address requires it, the la 
instruction also generates a lui (load upper immediate) machine instruc- 
tion. The machine requires the la instruction because la couples 
relocatable information with the instruction for symbolic addresses. 
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Depending on the expression's value, the assembler's li (load immediate) 
instruction can generate one or two machine instructions. For values in the 
-32768... 65535 range or for values that have zeros as the 16 least signifi- 
cant bits, the li instruction generates a single machine instruction;, other- 
wise it generates two machine instructions.. 



B.2 Computational Instructions 

If a computational instruction immediate value falls outside the 0... 65 53 5 
range for Logical ANDs, Logical ORs, or Logical XORs (exclusive or), the 
immediate field causes the machine to explicitly load a constant to a tem- 
porary register. Other instructions generate a single machine instruction 
when a value falls in the -32768... 32767 range. 

The assembler's seq (set equal) and sne (set not equal) instructions gener- 
ate three machine instructions each. 

If one operand is a literal outside the range -32768... 32767, the assem- 
bler's sge (set greater than or equal to) and sle (set less/equal) instruc- 
tions generate two machine instructions each. 

The assembler's mulo and mulou (multiply) instructions generate machine 
instructions to test for overflow and to move the result to a general regis- 
ter; if the destination register is $0, the check and move are not gener- 
ated. 

The assembler's mul (multiply unsigned) instruction generates a machine 
instruction to move the result to a general register; if the destination regis- 
ter is $0, the move and divide-by-zero checking is not generated. The 
assembler's divide instructions, div (divide with overflow) and divu (divide 
without overflow) , generate machine instructions to check for division by 
zero and to move the quotient into a general register; if the destination 
register is $0, the move is not generated. 

The assembler's rem (signed) and remu (unsigned) instructions also gen- 
erate multiple instructions. 

The rotate instructions ror (rotate right) and rol (rotate left) generate 
three machine instructions each. 

The abs (absolute value) instruction generates three machine instructions. 



( 



( 



c 
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B.3 Branch Instructions 



If the immediate value is not zero, the branch instructions beq (branch on 
equal) and bne (branch on not equal), each generate a load literal ma- 
chine instruction. The relational instructions generate a sit (set less than) 
machine instruction to determine whether one register is less than or 
greater than another. Relational instructions can reorder the operands and 
branch on either zero or not zero as required to do an operation. 



B.4 Coprocessor Instructions 

For symbolic addresses, the coprocessor interface Load and Store instruc- 
tions, Icz (load coprocessor z) and scz (store coprocessor z) can generate 
a lui (load upper immediate) machine instruction. 



B.5 Special Instructions 

The assembler's break instruction packs the breakcode operand in unused 
register fields. An operating system convention determines the position. 
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